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PATENT 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Atty.Dkt.No.: 4100.000582 

Prior Application Serial No.: 
08/479,722 

Prior Application Examiner: 
David L. Fitzgerald 

BOX PATENT APPLICATION Prior Group Art Unit: 1646 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

REQUEST FOR FILING CONTINUATION 
APPLICATION UNDER 37 C.F.R. 8 1.53(b) 

This is a request for filing a continuation application under Rule 53(b) (37 C.F.R. 
§ 1.53(b)) of co-pending prior application Serial No. 08/479,722 filed June 7, 1995, entitled 
"Latent TGFp Binding Protein (LTBP) Genes, Compositions and Methods", now amended to 
read "Recombinant Production of Latent TGFp Binding Protein-3 (LTBP-3)", now entitled 
"Methods of Using Latent TGFp Binding Proteins". 

X] 1. Enclosed is a copy of the prior application Serial No. 08/479,722 as originally 
filed, including specification, claims, drawings, and declarations (properly 
submitted after the filing date). The undersigned hereby verifies that the attached 



IXI 2. 



FOR 



papers are a true copy of the prior application as originally filed and identified 
above, that no amendments (if any) referred to in the declaration filed to complete 
the prior application introduced new matter therein, and further that this statement 
was made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment or both, under Section 1001 of Title 18 of 
the United States Code, and that such willful false statement may jeopardize the 
validity of the application or any patent issuing thereon. 

(a) [X] The inventorship is the same as prior Application Serial No. 

08/479,722. 

(b) Q Deletion of inventor(s). Signed statement attached deleting 

inventor(s) named in the prior application, see 37 C.F.R. § 1.63(d)(2) 
and 1.33(b). 

Enclosed is a check in the amount of $654.00 to cover the filing fee as calculated 

below and the fee for any new claims added in the Preliminary Amendment 

referred to in Clause No. 7 below. 

CLAIMS AS FILED IN THE PRIOR APPLICATION 
LESS CLAIMS CANCELED BELOW 



NUMBER 
FILED 



NUMBER 
EXTRA 



RATE 



FEE 



Basic Fee 



Total Claims 37 - 20 = 

Independent 7 - 3 = 

Claims 

Multiple Dependent Claim(s) 



17 X 
4 X 



$9.00 
$39.00 



$345.00 

$153.00 
$156.00 

$0.00 



TOTAL FILING FEES: 



$654.00 



3. Applicant is entitled to Small Entity Status for this application. 

(a) A small entity statement is enclosed. 

(b) A small entity statement was filed in the prior nonpro visional application and 
such status is still proper and desired (copy attached). 

(c) Small entity status is no longer claimed. 

4. If the check is missing or insufficient, the Assistant Commissioner is hereby 
authorized to charge any fees under 37 C.F.R. §§ 1.16 to 1.21 which may be 
required for any reason relating to this document, or credit any overpayment to 
Williams, Morgan & Amerson, P.C. Deposit Account No. 50-0786/4100.000582. 

5. Enclosed is a copy of the former and current Power of Attorney in the prior 
application. 

6. Address all future communications to: 

Shelley P.M. Fussey 

WILLIAMS, MORGAN & AMERSON, P.C. 
7676 Hillmont, Suite 250 
Houston, Texas 77040 
(713) 934-7000 

7. The prior application is presently assigned to The Regents of The University of 
Michigan (copy of Assignment attached). 

8. Enclosed is a preliminary amendment (including Exhibits A through F). Any 
additional fees incurred by this amendment are included in the check at No. 2 
above and said fee has been calculated after calculation of claims and after 
amendment of claims by the preliminary amendment. 
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^ 9. Cancel in this application claims 1-39, 41 and 42 (ie. 9 all claims except claim 40 ) 
of the prior application before calculating the filing fee. (At least one original 
independent claim must be retained). 

□ 10. Amend the specification by inserting before the first line the sentence: -This is a 
continuation of co-pending application Serial No. filed . 

3 11. Enclosed are formal drawings of FIG. 1 through FIG. 8C on a total of 8 sheets. 

XI 12. The present paper constitutes Applicants' compliance with their duty of disclosure 
under 37 C.F.R. § 1.56. Please make all prior art of record in parent application 
Serial No. 08/479,722 of record in this case. As a courtesy, the following is 
enclosed: 

^ (a) Copies of PTO- 1 449s from prior application. 

fl (b) Copies of IDS citations. 

[3 13. Transfer the sequence information, including the computer readable form 
submitted on August 24, 1999 in the parent application, Serial No. 08/479,722, 
for use in the application. 37 C.F.R. § 1.821(e). As a courtesy, a paper copy is 
enclosed. 



X] 1 4. Return Receipt Postcard. 



Respectfully submitted, 




WILLIAMS, MORGAN & AMERSON 
7676 Hillmont, Suite 250 
Houston, Texas 77040 
(713) 934-7000 
Date: June 12, 2000 



Shelley P.M. Fussey 
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Patent Agent 
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DECLARATION CLAIMING SMALL ENTITY STATUS 
37 C.F.R. §§ 1.9(f) and 1.27(d) - NONPROFIT ORGANIZATION 



Assistant Commissioner for Patents 
Washington, D.C. 20231 



Sir: 



I hereby declare that I am an official empowered to act on behalf of the 
nonprofit organization identified below: 



Address of Organization: 3003 South State Street 

Wolverine Tower, Room 2071 
Ann Arbor, Michigan 48109-1280 

The type of organization is a university. 

I hereby declare that the organization identified above qualifies as a nonprofit 
organization as defined in 37 C.F.R. § 1.9(e) (1), and thus is a "small entity" as 
defined in § 1.9(f), for purposes of paying reduced fees under Sections 41(a) and (b) 
of Title 35, United States Code, with regard to the above-referenced application. 

I hereby declare that exclusive rights to the invention have been conveyed to and 
remain with the organization, with respect to the above-referenced invention, nor have 
I assigned, granted, conveyed or licensed and am under no obligation under contract or 
law to assign, grant, convey or license, any rights in the invention to any person who 
could not be classified as an independent inventor under 37 CFR § 1.9(c) if that person 
had made the invention, or to any concern which would not qualify as a small business 
concern under 37 CFR § 1.9(d) or a nonprofit organization under 37 CFR § 1.9(e), with 
the exception that the Government may have rights in the invention pursuant to a 
funding agreement under 35 U.S.C. § 202(c)(4): NIH Grant No. HL-41926. 

I acknowledge the duty to file, in this application or patent, notification of 
any change in status resulting in loss of entitlement to small entity status prior to 
paying, or at the time of paying, the earliest of the issue fee or any maintenance fee 
due after the date on which status as a small entity is no longer appropriate. 

I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, under Section 1001 
of Title 18 of the United States Code, and that such willful false statements may 
jeopardize the validity of the application, any patent issuing thereon, or any patent 
to which this verified statement is directed. 



Name of Organization: 



The Regent of the University of Michigan 



THE REGENT OF THE UNIVERSITY OF MICHIGAN 




Name: Robert L. Robb 



Title: Director of Technology 
Management Office 
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PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Prior to examination, the Examiner is respectfully requested to enter the following 
amendments. Remarks supporting patentability of all claims are also included, which the 
Examiner is respectfully requested to consider. In light of the issuance or allowance of the three 
earlier applications to which priority is claimed, examination and consideration with a view to a 
timely allowance is respectfully requested. 
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AMENDMENT 

In the Specification: 

Please delete the title from the parent application and replace with the new title 
-- METHODS OF USING LATENT TGFp BINDING PROTEINS --. 

At page 2, line 5, after "30, 1994", please insert -, which issued as U.S. Patent 
No. 5,942,496 on August 24, 1999 --. 

At page 2, line 6, after "18, 1994", please insert -, which issued as U.S. Patent 
No. 5,763,416 on June 09, 1998 -. 

At page 21, beginning at line 24 and extending through page 22, line 15, please delete the 
entire text beginning with "FIG. 9." and extending through "40091 antiserum." 

At page 48, line 9, after the paragraph that ends with "manner.", please insert the 
following new paragraph - Accordingly, an LTBP protein or polypeptide may be provided to a 
repair tissue site or bone progenitor tissue site. A nucleic acid segment (DNA or RNA) that 
expresses an LTBP protein or polypeptide in cells of the tissue site may be provided, as may a 
nucleic acid segment in association with a structural biocompatible matrix (U.S. Patent No. 
5,942,496 and U.S. Patent No. 5,763,416). -. 

At page 75, line 7, after "pLTBP-3fl" please insert -- , deposited November 24, 1997 with 
the American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD 20852 (now at 
10801 University Boulevard, Manassas, VA 20110-2209), and given the ATCC Accession 
numbers ATCC 209496 -. 

At page 75, line 11, after "1995)", please delete "(FIG. 9)". 

At page 75, at blank line 12, please insert -- Co-transfection of 293T cells with pLTBP- 
3fl and pTGF-pl was followed by immunoprecipitation of LTBP-3 and TGF-pi produced by 



293T cells following transient transfection and radiolabeling. Aliquots (~10 6 incorporated CPM) 
of radiolabeled media were irnmunoprecipitated and separated using 4%-18% gradient SDS- 
PAGE and either reducing or nonreducing conditions as described (Yin et al., 1995). Cold 
standards were used to estimate molecular weights (200, 97.4, 69, 46, 30, 21.5 and 14.3 kDa; 
Rainbow mix, Amersham). The immunoprecipitation was followed by: 1, 293T cell medium 
proteins immunoprecipitated with 274 antibody (separated under reducing conditions) after co- 
transfection with pLTBP-3fl and pTGF-pl; 2,293T cell medium proteins immunoprecipitated 
with 274 antibody (separated under nonreducing conditions) after co-transfection with pLTBP- 
3fl and pTGF-pl; 3, untransfected 293T cell medium proteins immunoprecipitated with 274 
antibody (separated under reducing conditions); 4,293T cell medium proteins 
immunoprecipitated with 274 antibody (separated under reducing conditions) after transfection 
with pLTBP-3fl; 5, 293T cell medium proteins immunoprecipitated with 40091 antibody 
(separated under reducing conditions) after transfection with pTGF-pi; 6,293T cell medium 
proteins immunoprecipitated with 40091 antibody (separated under nonreducing conditions) after 
transfection with pTGF-pl; and Lane 7, 293T cell medium proteins immunoprecipitated with 
40091 antibody (separated under reducing conditions) after co-transfection with pLTBP-3fl and 
pTGF-pl . The signal was weakest in lanes in which proteins were immunoprecipitated using the 
40091 antibody, reflecting the weaker affinity of the 40091 antiserum. -. 

At pages 92-118, please delete the entire sequence listing. Please re-number the 
following pages (claims and abstract) consecutively. 

After page 127 (abstract), please insert the correct sequence listing pages submitted 
August 24, 1999 in the parent application (see accompanying Request at Box 13) and number 
these pages separately beginning with page 1. 



In the Claims: 

The accompanying paper requests cancellation of all pending claims except claim 40, 
without prejudice or disclaimer. 

Please amend claim 40 as follows: 
40. (Amended) A method for [identifying] binding a transforming growth factor p (TGF-P) 
protein in a sample, comprising contacting said sample with [an] a purified mammalian LTBP-2 
or LTBP-3 protein or polypeptide under conditions effective to allow binding [and detecting the 
protein so bound] of said LTBP-2 or LTBP-3 protein or polypeptide to said TGF-B protein: 
wherein said LTBP-2 or LTBP-3 protein or polypeptide comprises at least fifteen contiguous 
amino acids present in SEP ID NO:2 or SEQ ID NO:4, respectively . 

Please add new claims 43-78, as follows: 

43. The method of claim 40, wherein said sample is located within an animal and said 
LTBP-2 or LTBP-3 protein or polypeptide is administered to said animal in an amount effective 
to bind TGF-p in said animal. 

44. A method of binding TGF-p, comprising contacting a composition comprising TGF-p 
with a composition comprising a purified mammalian LTBP-2 or LTBP-3 protein or polypeptide 
in an amount effective to bind TGF-P; wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least fifteen contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 
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45. The method of claim 44, wherein said composition comprising TGF-p is located within 
an animal and said composition comprising said LTBP-2 or LTBP-3 protein or polypeptide is 
administered to said animal in an amount effective to bind TGF-p in said animal. 

46. A method of using an LTBP-2 or LTBP-3 protein, polypeptide or peptide, comprising 
providing to an animal a biologically effective amount of a purified mammalian LTBP-2 or 
LTBP-3 protein, polypeptide or peptide that comprises at least fifteen contiguous amino acids 
present in SEQ ID NO:2 or SEQ ID NO:4, respectively. 

47. The method of claim 46, wherein an amount of an LTBP-2 or LTBP-3 protein, 
polypeptide or peptide effective to generate anti-LTBP-2 or anti-LTBP-3 antibodies is provided 
to said animal. 

48. The method of claim 47, wherein an LTBP-2 or LTBP-3 peptide of between 15 and about 
50 amino acids in length is provided to said animal. 

49. The method of claim 47, wherein an LTBP-2 or LTBP-3 peptide of between 15 and about 
30 amino acids in length is provided to said animal. 

50. The method of claim 47, wherein antisera comprising said anti-LTBP-2 or anti-LTBP-3 
antibodies is collected from said animal. 



51. The method of claim 46, wherein an amount of an LTBP-2 or LTBP-3 protein or 
polypeptide effective to bind TGF-p is provided to said animal. 

52. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-P regulates TGF-p 
activity in said animal. 

53. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-p modulates the 
activation of TGF-p in said animal. 

54. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-p modulates the 
activation of latent complexes that comprise TGF-p, thereby regulating TGF-p activity. 

55 . The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-P targets TGF-p to 
the extracellular matrix in said animal. 

56. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-p targets TGF-p to 
the bone matrix in said animal. 

57. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-p targets TGF-p to 
connective tissues in said animal. 



58. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-p targets TGF-p to 
the cell surface of cells in said animal. 

59. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-P protects TGF-p 
from proteolytic attack and activation in said animal. 

60. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-p protects TGF-P 
from proteolytic attack and activation during wound repair or tissue healing in said animal. 

61. The method of claim 51, wherein said LTBP-2 or LTBP-3 protein or polypeptide is a 
recombinant protein or polypeptide prepared by expressing an LTBP-2-encoding or LTBP-3- 
encoding DNA segment in a recombinant host cell and purifying the expressed LTBP-2 or 
LTBP-3 protein or polypeptide away from total recombinant host cell components. 

62. The method of claim 51, wherein said TGF-P is located within a tissue healing, wound 
repair tissue site or bone progenitor tissue site of said animal and wherein said LTBP-2 or 
LTBP-3 protein or polypeptide is provided to said tissue site. 

63. The method of claim 62, wherein said TGF-P is located within a tissue healing or wound 
repair tissue site of said animal 

64. The method of claim 62, wherein said TGF-p is located within a bone progenitor tissue 
site of said animal. 



65. The method of claim 62, wherein said LTBP-2 or LTBP-3 protein or polypeptide is 
provided to said tissue site by contacting said tissue site with a composition comprising a nucleic 
acid segment that expresses said LTBP-2 or LTBP-3 protein or polypeptide in cells of said tissue 
site. 

66. The method of claim 65, wherein said LTBP-2 or LTBP-3 protein or polypeptide is 
provided to said tissue site by contacting said tissue site with a composition comprising said 
nucleic acid segment and a structural biocompatible matrix. 

67. The method of claim 65, wherein said nucleic acid segment is a DNA segment. 

68. The method of claim 65, wherein said nucleic acid segment is an RNA segment. 

69. The method of claim 51, wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least about thirty contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 

70. The method of claim 51, wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least about fifty contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 
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71 . The method of claim 5 1 , wherein said LTBP-2 or LTBP-3 protein or polypeptide exhibits 
at least 90% identity to the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 

72. The method of claim 5 1 , wherein said LTBP-2 or LTBP-3 protein or polypeptide exhibits 
between 91% and about 99% identity to the amino acid sequence set forth in SEQ ID NO:2 or 
SEQ ID NO:4, respectively. 

73. The method of claim 51, wherein an LTBP-2 protein comprising the amino acid sequence 
of SEQ ID NO:2 is provided to said animal. 

74. The method of claim 5 1 , wherein an LTBP-3 protein comprising the amino acid sequence 
of SEQ ID NO:4 is provided to said animal. 

75. A method of using an LTBP-2 or LTBP-3 protein or polypeptide, comprising 
administering to an animal a purified mammalian LTBP-2 or LTBP-3 protein or polypeptide in 
an amount effective to bind TGF-J3 in said animal; wherein said LTBP-2 or LTBP-3 protein or 
polypeptide specifically binds TGF-J3 and comprises at least fifteen contiguous amino acids 
present in SEQ ID NO:2 or SEQ ID NO:4, respectively. 

76. A method of binding TGF-0 within a repair or bone progenitor tissue site of an animal, 
comprising contacting said tissue site with a purified mammalian LTBP-2 or LTBP-3 protein or 
polypeptide, or a nucleic acid that expresses said LTBP-2 or LTBP-3 protein or polypeptide, to 



provide an amount of said LTBP-2 or LTBP-3 protein or polypeptide effective to bind TGF-p in 
said animal; wherein said LTBP-2 or LTBP-3 protein or polypeptide comprises at least fifteen 
contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4 ? respectively. 

77. A method of binding TGF-P, comprising administering to an animal a composition 
comprising a purified mammalian LTBP-2 or LTBP-3 protein or polypeptide in an amount 
effective to bind TGF-p in said animal; wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least fifteen contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 

78. A method of binding TGF-p , comprising administering to an animal a composition 
comprising a purified mammalian LTBP-3 protein or polypeptide in an amount effective to bind 
TGF-P in said animal; wherein said LTBP-3 protein or polypeptide binds TGF-p and comprises 
at least fifteen contiguous amino acids present in SEQ ID NO:4 or exhibits at least 90% identity 
to the amino acid sequence set forth in SEQ ID NO:4. 

REMARKS 

I. Status of the Claims 

The accompanying paper requests cancellation of all pending claims except claim 40 ? 
without prejudice or disclaimer. Presently, claim 40 is being amended to even further improve 
its clarity. New claims 43-78 have been added, which are supported by the original specification 
and parent applications incorporated therein by reference (see below). Numbering the present 
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claims starting with claim 43 is believed to be correct as the copy of the specification enclosed 
concludes with claim 42. 

Claims 40 and 43-783 are therefore in the case. For the convenience of the Examiner, a 
copy of the pending claims is attached hereto as Exhibit A. 

II* Continuing Application Status 

The present application is a continuing application based upon allowed application Serial 
No. 08/479,722 ("the 722 application; Attorney Docket Nos. 4100.000500, UMIC:013). The 
inventorship remains the same as the earlier application. 

A copy of the claims that will issue from the 722 application is enclosed as Exhibit B. 
The claims canceled from the 722 application in light of a restriction requirement are also 
provided herewith as Exhibit C 

III. Other Parent Applications 

The parent, 722 application is a continuation-in-part of PCT Application 
PCT/US95/02251, filed February 21, 1995 (Attorney Docket Nos. 4100.000410, UMIC:009P); 
which is a continuation-in-part of U.S. Application Serial Number 08/316,650 ("the ! 650 
application; Attorney Docket Nos. 4100.000300, UMIC:003), filed September 30, 1994; which is 
a continuation-in-part of U.S. Application Serial Number 08/199,780 ("the 780 application; 
Attorney Docket Nos. 4100.000200, UMIC:002), filed February 18, 1994. The f 650 application 
matured into U.S. Patent No. 5,942,496, and the 780 application earlier matured into U.S. Patent 
No. 5,763,416. 
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The entire text and figures of each of the '650 and 780 applications were specifically 
incorporated by reference into the 722 application on filing (see 722 application at page 2, 
opening paragraph). 

A copy of the claims to issue from the '650 application is enclosed as Exhibit D. The 
claims that issued from the 780 application are provided as Exhibit E, and the original claims in 
the 780 application are shown in Exhibit F. 

IV, Amendments to the Specification 

As the grandparent ( f 650) and great grandparent (780) applications were specifically 
incorporated by reference into the immediate parent (722) application (see page 2), the text of 
the T 650 and 780 applications forms part of the present disclosure. An amendment is presently 
being introduced into the instant specification from those earlier applications, to which priority is 
still claimed. This addition is fully supported by the original '650 and 780 applications and, for 
convenience of review, may be correlated with the issued claims in those cases attached as 
Exhibit D and Exhibit E. 

Incorporation of material into the specification by reference is proper. Ex parte Schwarze, 
151 USPQ 426 (Bd. App. 1966). Incorporation by reference of one or more issued U.S. patents, at 
the time of filing the application, also provides the enabling support required by 35 U.S.C § 112, 
first paragraph, for any claims advanced in the new application that rely on the same standard of 
teaching of "how to make and use" as the claims in the issued patent(s). 

Other issued U.S. patents that concern the use of exogenous growth factors in methods 
for treating bone defects or inducing bone formation in vivo are also incorporated by reference 
into the 722 specification. For example, U.S. Patent No. 4,877,864, which concerns the 
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administration of a therapeutic composition of bone inductive protein to treat cartilage and/or 
bone defects, and U.S. Patent 5,108,753, which concerns the use of a device containing a pure 
osteogenic protein to induce endochondral bone formation and for use in periodontal, dental or 
craniofacial reconstructive procedures (722 specification at pages 30 and 83). 

Upon agreement concerning allowable subject matter, Applicants intend to amend the 
'Summary of Invention' section to add the precise language of the allowed claims. In the interest 
of efficiency for both the Applicants and the Office, it seems most appropriate to defer 
amendment of the Summary until agreement on allowability. 

Certain amendments to the specification are also being made to conform the specification 
to the formal drawings enclosed herewith. These amendments, each of which were entered 
during prosecution of the 722 application, are fully supported by the original text and drawings. 

V. Support for the Claims 

Support for the present claims is to be found throughout the original parent (722) 
application and in the grandparent ('650) and great grandparent (780) applications, each 
specifically incorporated into the 722 application by reference and now progressed to issue. 

In addition to finding support throughout the present specification, the language of the 
current claims matches that found to be acceptable in the parent applications, including the 
definitions of the proteins, polypeptides, peptides and nucleic acids from the 722 application and 
the tissue site and gene-matrix embodiments from the f 650 and 780 applications. For the 
convenience of the Examiner, copies of the allowed claims from each of the three parent 
applications are attached hereto as Exhibits B 5 D and E. Additional support for the present 
claims exists in the specification and parent applications as follows. 
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Claim 40 is based upon the original claim (Exhibit C), although "identifying" has been 
replaced with "binding". Claim 44 is also based upon a version of claim 40 that includes the 
"binding" terminology. "Binding TGFp" is evidently an inherent property of latent TGFp 
binding proteins (LTBPs) and is supported by the entire 722 specification (e.g., see page 47, 
line 14 and page 68, line 5). 

The LTBP-2 and LTBP-3 proteins and polypeptides in all independent claims are defined 
as "purified, mammalian" proteins or polypeptides, as in the claims allowed in the parent 
application (Exhibit B). 

In independent claims 40, 44, 46, 75, 76 and 77, the language comprising "at least fifteen 
contiguous amino acids present in SEQID NO:4", as allowed in the parent application in 
reference to LTBP-3 (Exhibit B), has also been adapted for LTBP-2. In independent claim 78, 
which refers to LTBP-3 alone, the protein or polypeptide is defined in the alternative as either 
comprising at least fifteen contiguous amino acids from SEQ ID NO:4 or exhibiting at least 90% 
identity to the amino acid sequence of SEQ ID NO:4. These definitions were allowed in 
claims 127 and 145 of the 722 application (Exhibit B). 

Dependent claims 43 and 45 qualify independent claims 40 and 44 by reciting that the 
LTBP--2 or LTBP-3 proteins or polypeptides are administered to an animal to bind TGF-P* These 
claims are supported throughout the 722 application, e.g., at least at page 29, line 10 through 
page 30, line 28; page 44, line 30 through page 48, line 8; page 57, line 17 through page 58, 
line 29; and at page 71, line 7 through page 74, line 2. 

Similarly, independent claim 46 recites methods of use in which LTBP-2 or LTBP-3 
proteins, polypeptides or peptides are administered to an animal, and is supported by the 722 
application at least at page 2, lines 12-14; page 29, line 10 through page 30, line 28, as 
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exemplified by page 29, line 11 and page 30, lines 22-28; page 41, line 19 through page 42, 
line 4; page 44, line 30 through page 48, line 8; page 57, line 17 through page 58, line 29; 
page 71, line 7 through page 74, line 2; and at page 127, line 4. 

Claims 47 and 50 concern the administration of LTBP-2 or LTBP-3 proteins, 
polypeptides or peptides to generate anti-LTBP-2 or anti-LTBP-3 antibodies and antisera that 
may be collected from an animal. These are supported by Section 8 of the Detailed Description 
and in Example II of the 722 application, see, e.g. , page 40, lines 25-27. The exemplary peptides 
of between 15 and about 50 or 30 amino acids in length, as defined in claims 48 and 49, are 
supported by allowed claims 130 and 131 in the '722 application (Exhibit B). 

Dependent claims 51 through 60 recite various functional outcomes of LTBP binding to 
TGF-p, and are supported throughout the 722 application, notably by Sections 9 and 10 of the 
Detailed Description and in Example I. Basically, as TGF-p has the ability to regulate tissue 
remodeling and wound repair (see pages 2-8, 46, 47 and 57), and as LTBPs bind TGF-p 
(Example I), LTBPs are useful in the regulation of tissue remodeling and wound repair. 

The production and storage of TGF-p as a latent complex that is activated only under 
certain physiological (or pathological) conditions provides for the precise regulation of TGF-(3 
(pages 4, 6, 8, 47, 57, 71 and 73). LTBPs both regulate and target TGF-p activity (pages 45 
and 71) and help ensure TGF-p integrity in vivo (page 46). By protecting TGF-p from 
proteolytic attack under pathological conditions, LTBPs will moderate aberrant effects, such as 
reducing scar tissue during wound healing. LTBPs thus contribute to TGF-p regulation through 
a sophisticated feedback loop (pages 71 and 73). 

For particular written description support for claims 52 through 60, see page 45, line 23 
(claim 52; regulates TGF-p activity); page 46, line 5 and page 58, lines 22-27 (claims 53 and 54; 
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modulates the activation of latent complexes that comprise TGF-p); page 46, line 1 and page 58, 
lines 19-22 (claim 55; targets TGF-p to the extracellular matrix); page 48, lines 4-5 (claim 56; 
targets TGF-p to the bone matrix); page 45, line 29 and page 58, line 18 (claim 57; targets 
TGF-p to connective tissues); page 46, line 2 and page 58, lines 27-29 (claim 58; targets TGF-p 
to the cell surface); and page 3, lines 14, 15, 26 and 27; page 4, line 14; page 6, line 29; page 28, 
line 9; page 46, lines 7-11 and page 58, lines 27-29 (claims 59 and 60; protects TGF-P from 
proteolytic attack and activation, as in wound repair and tissue healing). 

Claim 61 defines the LTBP-2 or LTBP-3 proteins or polypeptides as recombinant 
proteins or polypeptides and is based upon original claim 22 (Exhibit C). 

Each of claims 62-64 define the location of the TGF-p within the animal as a tissue 
healing, wound repair tissue site or bone progenitor tissue site. These are again supported 
throughout the 722 application, notably by Sections 1-4, 9 and 10 of the Detailed Description 
and Example I (see also pages 2-8, 22, 27, 28, 46, 47, 57 and 58, particularly page 3, lines 26-27; 
page 4, line 14; page 6, line 29; page 28, line 9; page 46, lines 8-11; page 47, lines 19-22; and 
page 48, line 1). The "repair" and "bone progenitor" tissue sites also correlate with the language 
of the gene-matrix method claims allowed in the grandparent ('650) and great grandparent (780) 
applications (Exhibit D and Exhibit E). 

Claims 65-68 concern the use of nucleic acids (claim 65) and nucleic acid-structural 
matrix compositions (claim 66) to provide the LTBP-2 or LTBP-3 protein or polypeptide to 
TGF-p in a repair or bone progenitor tissue site of an animal Both DNA (claim 67) and RNA 
(claim 68) nucleic acids may be employed (see also claims 161 and 162 of the 722 application, 
Exhibit B). RNA may be preferred in certain embodiments, particularly as traversal of the 
nuclear envelope is not necessary for expression. 
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The use of nucleic acids and nucleic acids in functional association with structural 
matrices are predominantly supported by the '650 and 780 applications specifically incorporated 
by reference into the 722 application (see Exhibits D, E and F). Accordingly, such language is 
being introduced into the present application by amendment at a convenient point of the detailed 
description. Incorporation of material into the specification by reference is proper. Ex parte 
Schwarze, supra. 

Exemplary LTBP-2 and LTBP-3 proteins and polypeptides for use in the claimed 
invention are described in claims 69 through 74. These claims are based upon those allowed in 
the 722 application (see claims 128, 129, 138 and 144-146 of Exhibit B). 

Claims 75 through 78 are variations of earlier claims presented in independent form. 
Support for these claims is as described above, e.g., for claim 75, see claims 46, 43, 45 and 51; 
for claim 76, see claims 51, 43, 45 and 62-64; for claim 77, see claim 51; and for claim 78, which 
refers to LTBP-3 only, see claims 51 and 71. 

It will therefore be understood that no new matter is included within the present claims. 

VI. Formalities 

Applicants' representative, Shelley Fussey, has changed law firms since the parent 
application was filed. A new Power of Attorney was submitted in the parent application and an 
additional copy is enclosed herewith. All communications should be directed to the address 
listed therein and at the end of this document. 

Formal drawings are included herewith. 

No fees should be due in addition to the enclosed filing fees. However, should any 
additional fees under 37 CJF.R, §§ 1.16 to 1.21 be required for any reason, the Assistant 
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Commissioner is authorized to deduct said fees from Williams, Morgan & Amerson, P.C. 
Deposit Account No. 50-0786/4100.000582. 

VII. Conclusion 

The present claims are believed to be condition for allowance, and an early indication to 
this effect is respectfully requested. Should Examiner Fitzgerald have any questions or 
comments, a telephone call to the undersigned Applicants representative is earnestly solicited. 



WILLIAMS, MORGAN & AMERSON, P.C. 
7676 Hillmont, Suite 250 
Houston, Texas, 77040 
(713) 934-4079 

Date: June 12, 2000 



Respectfully submitted, 



Shelley P.M. Fussey 
Reg. No. 39,458 
Agent for Applicants 
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EXHIBIT A 
Pending Claims In Continuation 



40. (Amended) A method for [identifying] binding a transforming growth factor p (TGF-p) 
protein in a sample, comprising contacting said sample with [an] a purified mammalian LTBP-2 
or LTBP-3 protein or polypeptide under conditions effective to allow binding [and detecting the 
protein so bound] of said LTBP-2 or LTBP-3 protein or polypeptide to said TGF-B protein; 
wherein said LTBP-2 or LTBP-3 protein or polypeptide comprises at least fifteen contiguous 
amino acids present in SEP ID NO:2 or SEP ID NO:4. respectively . 

43. The method of claim 40, wherein said sample is located within an animal and said 
LTBP-2 or LTBP-3 protein or polypeptide is administered to said animal in an amount effective 
to bind TGF-p in said animal. 

44. A method of binding TGF-p, comprising contacting a composition comprising TGF-P 
with a composition comprising a purified mammalian LTBP-2 or LTBP-3 protein or polypeptide 
in an amount effective to bind TGF-p; wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least fifteen contiguous amino acids present in SEQ ID NG:2 or SEQ ID NG:4, 
respectively. 

45. The method of claim 44, wherein said composition comprising TGF-p is located within 
an animal and said composition comprising said LTBP-2 or LTBP-3 protein or polypeptide is 
administered to said animal in an amount effective to bind TGF-p in said animal. 

46. A method of using an LTBP-2 or LTBP-3 protein, polypeptide or peptide, comprising 
providing to an animal a biologically effective amount of a purified mammalian LTBP-2 or 
LTBP-3 protein, polypeptide or peptide that comprises at least fifteen contiguous amino acids 
present in SEQ ID NG:2 or SEQ ID NP:4, respectively. 

47. The method of claim 46, wherein an amount of an LTBP-2 or LTBP-3 protein, 
polypeptide or peptide effective to generate anti-LTBP-2 or anti-LTBP-3 antibodies is provided 
to said animal. 

48. The method of claim 47, wherein an LTBP-2 or LTBP-3 peptide of between 15 and about 
50 amino acids in length is provided to said animal. 

49. The method of claim 47, wherein an LTBP-2 or LTBP-3 peptide of between 1 5 and about 
30 amino acids in length is provided to said animal. 



50. The method of claim 47, wherein antisera comprising said anti-LTBP-2 or anti-LTBP-3 
antibodies is collected from said animal. 

51. The method of claim 46, wherein an amount of an LTBP-2 or LTBP-3 protein or 
polypeptide effective to bind TGF-p is provided to said animal. 

52. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-P regulates TGF-P 
activity in said animal. 

53. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-p modulates the 
activation of TGF-P in said animal. 

54. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-P modulates the 
activation of latent complexes that comprise TGF-P, thereby regulating TGF-P activity. 

55. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-P targets TGF-p to 
the extracellular matrix in said animal. 

56. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-p targets TGF-p to 
the bone matrix in said animal. 

57. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-p targets TGF-p to 
connective tissues in said animal. 

58. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-p targets TGF-p to 
the cell surface of cells in said animal. 

59. The method of claim 51, wherein LTBP-2 or LTBP-3 binding to TGF-p protects TGF-p 
from proteolytic attack and activation in said animal. 

60. The method of claim 5 1 , wherein LTBP-2 or LTBP-3 binding to TGF-p protects TGF-P 
from proteolytic attack and activation during wound repair or tissue healing in said animal. 



6L The method of claim 51 , wherein said LTBP-2 or LTBP-3 protein or polypeptide is a 
recombinant protein or polypeptide prepared by expressing an LTBP-2-encoding or LTBP-3 - 
encoding DNA segment in a recombinant host cell and purifying the expressed LTBP-2 or 
LTBP-3 protein or polypeptide away from total recombinant host cell components. 

62. The method of claim 51, wherein said TGF-p is located within a tissue healing, wound 
repair tissue site or bone progenitor tissue site of said animal and wherein said LTBP-2 or 
LTBP-3 protein or polypeptide is provided to said tissue site. 

63. The method of claim 62, wherein said TGF-P is located within a tissue healing or wound 
repair tissue site of said animal. 

64. The method of claim 62, wherein said TGF-p is located within a bone progenitor tissue 
site of said animal. 

65. The method of claim 62, wherein said LTBP-2 or LTBP-3 protein or polypeptide is 
provided to said tissue site by contacting said tissue site with a composition comprising a nucleic 
acid segment that expresses said LTBP-2 or LTBP-3 protein or polypeptide in cells of said tissue 
site. 

66. The method of claim 65, wherein said LTBP-2 or LTBP-3 protein or polypeptide is 
provided to said tissue site by contacting said tissue site with a composition comprising said 
nucleic acid segment and a structural biocompatible matrix. 

67. The method of claim 65, wherein said nucleic acid segment is a DNA segment. 

68. The method of claim 65, wherein said nucleic acid segment is an RNA segment. 

69. The method of claim 51, wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least about thirty contiguous amino acids present in SEQ ID NO:2 or SEQ ID 
NO:4, respectively. 

70. The method of claim 51, wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least about fifty contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 



71. The method of claim 51, wherein said LTBP-2 or LTBP-3 protein or polypeptide exhibits 
at least 90% identity to the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 

72. The method of claim 5 1 , wherein said LTBP-2 or LTBP-3 protein or polypeptide exhibits 
between 91% and about 99% identity to the amino acid sequence set forth in SEQ ID NO:2 or 
SEQ ID NO:4, respectively. 

73 . The method of claim 5 1 , wherein an LTBP-2 protein comprising the amino acid sequence 
of SEQ ID NO:2 is provided to said animal. 

74. The method of claim 5 1 , wherein an LTBP-3 protein comprising the amino acid sequence 
of SEQ ID NO:4 is provided to said animal. 

75. A method of using an LTBP-2 or LTBP-3 protein or polypeptide, comprising 
administering to an animal a purified mammalian LTBP-2 or LTBP-3 protein or polypeptide in 
an amount effective to bind TGF-p in said animal; wherein said LTBP-2 or LTBP-3 protein or 
polypeptide specifically binds TGF-p and comprises at least fifteen contiguous amino acids 
present in SEQ ID NO:2 or SEQ ID NO:4, respectively. 

76. A method of binding TGF-p within a repair or bone progenitor tissue site of an animal, 
comprising contacting said tissue site with a purified mammalian LTBP-2 or LTBP-3 protein or 
polypeptide, or a nucleic acid that expresses said LTBP-2 or LTBP-3 protein or polypeptide, to 
provide an amount of said LTBP-2 or LTBP-3 protein or polypeptide effective to bind TGF-p in 
said animal; wherein said LTBP-2 or LTBP-3 protein or polypeptide comprises at least fifteen 
contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4, respectively. 

77. A method of binding TGF-p, comprising administering to an animal a composition 
comprising a purified mammalian LTBP-2 or LTBP-3 protein or polypeptide in an amount 
effective to bind TGF-p in said animal; wherein said LTBP-2 or LTBP-3 protein or polypeptide 
comprises at least fifteen contiguous amino acids present in SEQ ID NO:2 or SEQ ID NO:4, 
respectively. 

78. A method of binding TGF-P, comprising administering to an animal a composition 
comprising a purified mammalian LTBP-3 protein or polypeptide in an amount effective to bind 
TGF-p in said animal; wherein said LTBP-3 protein or polypeptide binds TGF-p and comprises 
at least fifteen contiguous amino acids present in SEQ ID NO:4 or exhibits at least 90% identity 
to the amino acid sequence set forth in SEQ ID NO:4. 



EXHIBIT B 
Allowed Claims in Parent, Serial No. 08/479,722 



127. An isolated nucleic acid molecule comprising a sequence encoding at least fifteen 
contiguous amino acids present in SEQ ID NO:4. 



128. The isolated nucleic acid molecule of claim 127, comprising a sequence that encodes at 
least about thirty contiguous amino acids present in SEQ ID NO:4. 



129. The isolated nucleic acid molecule of claim 127, comprising a sequence that encodes at 
least about fifty contiguous amino acids present in SEQ ID NO:4. 



130. The isolated nucleic acid molecule of claim 127, comprising a sequence which, upon 
translation, affords a peptide of between 15 and about 50 amino acids in length. 



131. The isolated nucleic acid molecule of claim 127, comprising a sequence which, upon 
translation, affords a peptide of between 15 and about 30 amino acids in length. 



132. The isolated nucleic acid molecule of claim 127, comprising a sequence that encodes a 
polypeptide or peptide in which said at least fifteen contiguous amino acids present in SEQ ID 
NO:4 are present in domain 1, domain 2, domain 3, domain 4 or domain 5 of the LTBP-3 
polypeptide sequence set forth in SEQ ID NO:4. 

133. The isolated nucleic acid molecule of claim 132, comprising a sequence that encodes a 
polypeptide or peptide in which said at least fifteen contiguous amino acids present in SEQ ID 
NO:4 are present in domain 1 of the LTBP-3 polypeptide sequence set forth in SEQ ID NO:4. 

134. The isolated nucleic acid molecule of claim 132, comprising a sequence that encodes a 
polypeptide or peptide in which said at least fifteen contiguous amino acids present in SEQ ID 
NO:4 are present in domain 2 of the LTBP-3 polypeptide sequence set forth in SEQ ID NO:4. 

135. The isolated nucleic acid molecule of claim 132, comprising a sequence that encodes a 
polypeptide or peptide in which said at least fifteen contiguous amino acids present in SEQ ID 
NO:4 are present in domain 3 of the LTBP-3 polypeptide sequence set forth in SEQ ID NO:4. 



l 



136. The isolated nucleic acid molecule of claim 132, comprising a sequence that encodes a 
polypeptide or peptide in which said at least fifteen contiguous amino acids present in SEQ ID 
NO:4 are present in domain 4 of the LTBP-3 polypeptide sequence set forth in SEQ ID NO:4. 

137. The isolated nucleic acid molecule of claim 132, comprising a sequence that encodes a 
polypeptide or peptide in which said at least fifteen contiguous amino acids present in SEQ ID 
NO:4 are present in domain 5 of the LTBP-3 polypeptide sequence set forth in SEQ ID NO:4. 



138. The isolated nucleic acid molecule of claim 127, comprising a sequence that encodes a 
polypeptide comprising about 1,251 contiguous amino acids in SEQ ID NO:4. 



140. (Amended) The isolated nucleic acid molecule of claim 138, comprising the contiguous 
nucleic acid sequence of SEQ ID NO:3. 



141. The isolated nucleic acid molecule of claim 127, further comprising a recombinant 
promoter. 



142. The isolated nucleic acid molecule of claim 141, wherein said sequence is positioned, in 
reverse orientation, under the control of a promoter that directs the expression of an antisense 
product. 



143. The isolated nucleic acid molecule of claim 127, further defined as a recombinant vector. 



144. An isolated nucleic acid molecule comprising a sequence that encodes a protein having 
the contiguous amino acid sequence of SEQ ID NO:4. 



145. An isolated nucleic acid molecule encoding a polypeptide that exhibits at least 90% 
identity to the amino acid sequence set forth in SEQ ID NO:4, wherein said polypeptide 
specifically binds to TGF-pi. 



146. The isolated nucleic acid molecule of claim 145, wherein the encoded polypeptide 
exhibits between 91% and about 99% identity to the amino acid sequence set forth in SEQ ID 
NO:4. 



147. An isolated nucleic acid molecule encoding a mammalian LTBP-3 polypeptide, wherein 
said nucleic acid molecule comprises the nucleotide sequence of: 

(a) the coding sequence of a cDNA molecule present in a mammalian library, 
wherein the cDNA molecule hybridizes with a probe having the sequence of the 
complement of SEQ ID NO: 3 under conditions of high stringency; or 

(b) a nucleotide sequence degenerate with a sequence according to (a). 

148. The isolated nucleic acid molecule of claim 147, wherein said nucleic acid molecule 
comprises the nucleotide sequence of the coding sequence of a cDNA molecule present in a 
mammalian library, wherein the cDNA molecule hybridizes with a probe having the sequence of 
the complement of SEQ ID NO:3 under conditions of high stringency. 

149. The isolated nucleic acid molecule of claim 147, wherein said nucleic acid molecule 
comprises a nucleotide sequence degenerate with the coding sequence of a cDNA molecule 
present in a mammalian library, wherein the cDNA molecule hybridizes with a probe having the 
sequence of the complement of SEQ ID NO:3 under conditions of high stringency. 

152. (Amended) The isolated nucleic acid molecule of claim 147, wherein the nucleic acid 
molecule comprises a nucleotide sequence of at least 50 contiguous nucleotides present in SEQ 
ID NO:3. 



153. The isolated nucleic acid molecule of claim 152, wherein the nucleic acid molecule 
comprises a nucleotide sequence of at least 100 contiguous nucleotides present in SEQ ID NO:3. 

154. The isolated nucleic acid molecule of claim 153, wherein the nucleic acid molecule 
comprises a nucleotide sequence of at least 200 contiguous nucleotides present in SEQ ID NO:3. 

155. The isolated nucleic acid molecule of claim 154, wherein the nucleic acid molecule 
comprises a nucleotide sequence of at least 500 contiguous nucleotides present in SEQ ID NO:3. 

156. The isolated nucleic acid molecule of claim 155, wherein the nucleic acid molecule 
comprises a nucleotide sequence of at least 1000 contiguous nucleotides present in SEQ ID 
NO:3. 



157. The isolated nucleic acid molecule of claim 156, wherein the nucleic acid molecule 
comprises a nucleotide sequence of at least 3000 contiguous nucleotides present in SEQ ID 
NO:3. 



159. The isolated nucleic acid molecule of claim 147, wherein the nucleic acid molecule is up 
to about 10,000 basepairs in length. 



160. The isolated nucleic acid molecule of claim 159, wherein the nucleic acid molecule is up 
to about 5,000 basepairs in length. 



161. The isolated nucleic acid molecule of claim 147, further defined as a DNA molecule. 



162. The isolated nucleic acid molecule of claim 147, further defined as an RNA molecule. 



163. An isolated nucleic acid molecule encoding a mammalian LTBP-3 polypeptide, wherein 
said nucleic acid molecule comprises the nucleotide sequence of 

(a) the coding sequence of a cDNA molecule present in a mammalian library, 
wherein the cDNA molecule hybridizes, under conditions of high stringency, with 
a probe having the sequence of the complement of the isolated LTBP-3 sequence 
region within the biological material deposited as ATCC 209496; or 

(b) a nucleotide sequence degenerate with a sequence according to (a). 

164. (Amended) The isolated nucleic acid molecule of claim 163, wherein said nucleic acid 
molecule [encodes a mammalian LTBP-3 polypeptide that includes at least fifteen contiguous 
amino acids encoded by] comprises a sequence of at least 50 contiguous nucleotides present in 
the sequence of the isolated LTBP-3 sequence region within the biological material deposited as 
ATCC 209496. 



165. The isolated nucleic acid molecule of claim 164, wherein said nucleic acid molecule 
encodes a mammalian LTBP-3 polypeptide that comprises about 1,251 contiguous amino acids 
encoded by the isolated LTBP-3 sequence region within the biological material deposited as 
ATCC 209496. 
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166. The isolated nucleic acid molecule of claim 165, wherein said nucleic acid molecule has 
the nucleotide sequence of the isolated LTBP-3 sequence region within the biological material 
deposited as ATCC 209496. 

167. (Amended) An isolated nucleic acid molecule encoding a mammalian LTBP-3 
polypeptide, wherein said nucleic acid molecule comprises the nucleotide sequence of 

the coding sequence of a cDNA molecule present in a mammalian library, 
wherein the cDNA molecule hybridizes, under conditions of high stringency, with 
a probe having the sequence of the complement of the isolated LTBP-3 sequence 
region within the vector pLTBP-3fl, deposited as ATCC 209496. 

168. The isolated nucleic acid molecule of claim 167, wherein said nucleic acid molecule 
encodes a mammalian LTBP-3 polypeptide that has the contiguous amino acid sequence encoded 
by the isolated LTBP-3 sequence region within the vector pLTBP-3fl. 

169. A recombinant host cell comprising an isolated nucleic acid molecule in accordance with 
claim 127, claim 144, claim 145, claim 147, claim 163 or claim 167. 



170. The recombinant host cell of claim 1 69, further defined as a prokaryotic host cell. 

171. The recombinant host cell of claim 1 69, further defined as a eukaryotic host cell. 

172. The recombinant host cell of claim 169, wherein the isolated nucleic acid molecule is 
introduced into the cell by means of a recombinant vector and the host cell expresses the isolated 
nucleic acid molecule to produce the encoded protein or peptide. 

173. (Amended) A method of using an isolated nucleic acid molecule that encodes a protein or 
peptide, the method comprising expressing, in a recombinant host cell, a recombinant vector that 
comprises an isolated nucleic acid molecule in accordance with claim 127, claim 144, claim 145, 
claim 147, claim 163 or claim 167, and collecting the expressed protein or peptide. 



EXHIBIT C 

Claims Restricted from Parent, Serial No. 08/479,722 



Group III: 

22. A recombinant LTBP-2 or LTBP-3 protein or peptide prepared by expressing an 
LTBP-2- or LTBP-3 -encoding DNA segment in a recombinant host cell and purifying the 
expressed LTBP-2- or LTBP-3 protein or peptide away from total recombinant host cell 
components. 

42. A composition comprising a purified murine LTBP-2 or LTBP-3 polypeptide. 
Part of Group IV 

23. A method for detecting an LTBP-2 or LTBP-3 nucleic acid segment in a sample, 
comprising the steps of: 

(a) obtaining sample nucleic acids from a sample suspected of containing an 
LTBP-2 or LTBP-3 nucleic acid segment; 

(b) contacting said sample nucleic acids with an isolated LTBP-2 or LTBP-3 
nucleic acid segment under conditions effective to allow hybridization of 
substantially complementary nucleic acids; and 

(c) detecting the hybridized complementary nucleic acids thus formed. 



Group V: 

37. A purified antibody that binds to an LTBP-3 protein or peptide. 

38. The antibody of claim 37, wherein the antibody is linked to a detectable label. 

39. An immunodetection kit comprising, in suitable container means, an LTBP-3 protein or 
peptide, or a first antibody that binds to an LTBP-3 protein or peptide, and an immunodetection 
reagent. 

Group VI: 

40. A method for identifying a transforming growth factor P protein in a sample, comprising 
contacting said sample with an LTBP-2 or LTBP-3 protein under conditions effective to allow 
binding and detecting the protein so bound. 



41. A method for purifying transforming growth factor p (TGFp) protein in a sample, 
comprising: 

(a) contacting a sample suspected of containing TGFp p protein with an 
LTBP-2 or LTBP-3 protein under conditions effective to allow specific 
binding; and 

(b) collecting the bound TGFp, substantially free from the non-bound 
components. 



EXHIBIT D 

Claims to Issue in Grandparent, Patent No. 5,942,496 



51. (Amended) A method for transferring nucleic acid segments into bone progenitor cells 
located within a bone progenitor tissue site of an animal, comprising contacting said tissue site with 
a composition comprising two or more nucleic acid segments and a structural bone-compatible 
matrix, so as to transfer said two or more nucleic acid segments into said cells, wherein said cells 
express transcriptional or translational products encoded by said nucleic acid segments. 

52. (Amended) The method of claim 51, comprising contacting bone progenitor cells with a 
composition comprising two nucleic acid segments and a structural bone-compatible matrix. 

53. (Amended) The method of claim 51, comprising contacting bone progenitor cells with a 
composition comprising three nucleic acid segments and a structural bone-compatible matrix. 

55. (Amended) The method of claim 51, wherein the contacting process comprises bringing 
said two or more nucleic acid segments into contact with said structural bone-compatible matrix to 
form a matrix-nucleic acid segments composition and bringing said matrix-nucleic acid segments 
composition into contact with said tissue site. 

56. (Amended) The method of claim 55, wherein said nucleic acid segments are absorbed in 
said structural bone-compatible matrix. 

57. (Amended) The method of claim 55, wherein said nucleic acid segments are adsorbed to 
said structural bone-compatible matrix. 

58. (Amended) The method of claim 55, wherein said nucleic acid segments are impregnated 
within said structural bone-compatible matrix. 

59. The method of claim 51, wherein said bone progenitor cells are stem cells, macrophages, 
fibroblasts, vascular cells, osteoblasts, chondroblasts or osteoclasts. 

60. The method of claim 59, wherein said bone progenitor cells are fibroblasts. 



6 1 . The method of claim 5 1 , wherein at least one of said two or more nucleic acid segments is a 
DNA molecule. 



62. (Amended) The method of claim 51, wherein at least one of said two or more nucleic acid 
segments is an antisense nucleic acid molecule. 

63. The method of claim 51, wherein at least one of said two or more nucleic acid segments is a 
linear nucleic acid molecule, a plasmid or a recombinant insert within the genome of a recombinant 
virus. 



64. (Amended) The method of claim 51, wherein at least one of said two or more nucleic acid 
segments encodes a polypeptide or protein that stimulates bone progenitor cells when expressed by 
said cells. 

65. (Amended) The method of claim 51, wherein said structural bone-compatible matrix is a 
collagenous, metal, hydroxylapatite, bioglass, aluminate, bioceramic, acrylic ester polymer, lactic 
acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

66. (Amended) The method of claim 65, wherein said structural bone-compatible matrix is a 
titanium matrix. 



67. (Amended) The method of claim 66, wherein said structural bone-compatible matrix is a 
titanium matrix coated with hydroxylapatite. 

68. (Amended) The method of claim 65, wherein said structural bone-compatible matrix is a 
collagen preparation. 

69. (Amended) The method of claim 68, wherein said structural bone-compatible matrix is a 
type II collagen preparation. 



70. (Amended) The method of claim 69, wherein said structural bone-compatible matrix is a 
recombinant type II collagen preparation. 



71. (Amended) The method of claim 69, wherein said structural bone-compatible matrix is a 
type II collagen preparation further supplemented with minerals. 



72. (Amended) The method of claim 65, wherein said structural bone-compatible matrix is a 
lactic acid polymer, glycolic acid polymer or lactic acid/gly colic acid polymer matrix. 

73. (Amended) A method of stimulating bone progenitor cells located within a bone progenitor 
tissue site of an animal, comprising contacting said tissue site with a composition comprising two 
or more osteotropic genes and a structural bone-compatible matrix so as to promote expression of 
said genes by said cells. 

74. (Amended) The method of claim 73, comprising contacting said tissue site with a 
composition comprising two osteotropic genes and a structural bone-compatible matrix. 

75. (Amended) The method of claim 73, comprising contacting said tissue site with a 
composition comprising three osteotropic genes and a structural bone-compatible matrix. 

76. (Amended) The method of claim 73, wherein expression of said osteotropic genes by said 
cells stimulates said cells to promote bone tissue growth. 

77. (Amended) The method of claim 73, wherein the contacting process comprises bringing 
said osteotropic genes into contact with said structural bone-compatible matrix to form a matrix- 
genes composition and bringing said matrix-genes composition into contact with said tissue site. 

78. (Amended) The method of claim 73, wherein said osteotropic genes are absorbed in said 
structural bone-compatible matrix. 

79. (Amended) The method of claim 73, wherein said osteotropic genes are adsorbed in said 
structural bone-compatible matrix. 



80. (Amended) The method of claim 73, wherein said osteotropic genes are impregnated within 
said structural bone-compatible matrix. 



81. The method of claim 73, wherein said bone progenitor cells are stem cells, macrophages, 
fibroblasts, vascular cells, osteoblasts, chondroblasts or osteoclasts. 



82. The method of claim 8 1 , wherein said bone progenitor cells are fibroblasts. 



83. The method of claim 73 , wherein at least one of said two or more osteotropic genes is in the 
form of plasmid DNA, a DNA insert within the genome of a recombinant adenovirus, a DNA insert 
within the genome of a recombinant adeno-associated virus (AAV) or a DNA insert within the 
genome of a recombinant retrovirus. 

84. The method of claim 73, wherein at least one of said two osteotropic genes is a parathyroid 
hormone (PTH) gene, a bone morphogenetic protein (BMP) gene, a growth factor gene, a growth 
factor receptor gene, a cytokine gene or a chemotactic factor gene. 

85. (Amended) The method of claim 84, wherein at least one of said two osteotropic genes is a 
transforming growth factor (TGF) gene, a fibroblast growth factor (FGF) gene, a 
granulocyte/macrophage colony stimulating factor (GMCSF) gene, an epidermal growth factor 
(EGF) gene, a platelet derived growth factor (PDGF) gene, an insulin-like growth factor (IGF) 
gene, a latent TGF-J5 binding protein (LTBP) gene or a leukemia inhibitory factor (LIF) gene. 



86. (Amended) The method of claim 85, wherein at least one of said two or more osteotropic 
genes is a TGF-oc, TGF-p 1 or TGF-P2 gene. 

87. The method of claim 84, wherein at least one of said two osteotropic genes is a PTH gene. 



88. The method of claim 84, wherein at least one of said two osteotropic genes is a BMP-2A, 
BMP-2B, BMP-3, BMP-4, BMP-5, BMP-6, BMP-7 or BMP-8 gene. 

89. The method of claim 84, wherein at least one of said two osteotropic genes is a PTH1-34 
gene, a BMP-2 gene or a BMP-4 gene. 

90. The method of claim 84, wherein said composition comprises a PTH gene and a BMP gene. 



91. The method of claim 90, wherein said composition comprises a PTH 1-34 gene and a 
BMP-4 gene. 



92. (Amended) The method of claim 73 , wherein said structural bone-compatible matrix is a 
collagenous, metal, hydroxylapatite, bioglass, aluminate, bioceramic, acrylic ester polymer, lactic 
acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

93. (Amended) The method of claim 92, wherein said structural bone-compatible matrix is a 
titanium matrix. 

94. (Amended) The method of claim 93, wherein said structural bone-compatible matrix is a 
titanium matrix coated with hydroxylapatite. 

95. (Amended) The method of claim 92, wherein said structural bone-compatible matrix is a 
collagen preparation. 

96. (Amended) The method of claim 95, wherein said structural bone-compatible matrix is a 
type II collagen preparation. 

97. (Amended) The method of claim 96, wherein said structural bone-compatible matrix is a 
recombinant type II collagen preparation. 

98. (Amended) The method of claim 96, wherein said structural bone-compatible matrix is a 
mineralized type II collagen preparation. 

99. (Amended) The method of claim 92, wherein said structural bone-compatible matrix is a 
lactic acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

100. (Amended) The method of claim 73, wherein said composition is applied to a bone fracture 
site in said animal. 

101. (Amended) The method of claim 73, wherein said composition is implanted within a bone 
cavity site in said animal. 

102. The method of claim 101, wherein said bone cavity site is the result of dental or periodontal 
surgery or the removal of an osteosarcoma. 



103. (Amended) A method of delivering two or more nucleic acid segments to a fibroblast cell 
located within a repair tissue site of an animal, comprising contacting said tissue site with a 
composition comprising two or more nucleic acid segments and a structural bone-compatible 
matrix to effect uptake of the nucleic acid segments into the fibroblast cell and to promote 
expression of transcriptional or translational products by said fibroblast cell. 

104. (Amended) A method of delivering two or more selected nucleic acid segments to a 
fibroblast cell located within a repair tissue site of an animal, comprising the steps of: 

(a) preparing a matrix-nucleic acid composition comprising two or more nucleic acid 
segments and a structural bone-compatible matrix; and 

(b) contacting said repair tissue site with the structural matrix-nucleic acid composition 
to effect uptake of the nucleic acid segments by the fibroblast cell, wherein said 
fibroblast cell expresses transcriptional or translational products encoded by said 
nucleic acid segments. 

105. (Amended) The method of claim 104, wherein step (a) comprises preparing a matrix- 
nucleic acid composition comprising two nucleic acid segments and a structural bone-compatible 
matrix. 

106. (Amended) The method of claim 104, wherein step (a) comprises preparing a matrix- 
nucleic acid composition comprising three nucleic acid segments and a structural bone-compatible 
matrix. 

108. (Amended) The method of claim 104, wherein said nucleic acid segments are absorbed in or 
adsorbed to said structural bone-compatible matrix. 

109. (Amended) The method of claim 104, wherein said nucleic acid segments are impregnated 
within said structural bone-compatible matrix. 

1 10. The method of claim 104, wherein at least one of said two or more nucleic acid segments is 
a DNA molecule. 

111. The method of claim 104, wherein at least one of said two or more nucleic acid segments is 
an RNA molecule. 



1 12. (Amended) The method of claim 104, wherein at least one of said two or more nucleic acid 
segments is an antisense nucleic acid molecule. 

113. The method of claim 1 04, wherein at least one of said two or more nucleic acid segments is 
a linear nucleic acid molecule, a plasmid or a recombinant insert within the genome of a 
recombinant virus. 

1 14. The method of claim 104, wherein at least one of said two or more nucleic acid segments is 
an osteotropic gene. 

115. The method of claim 114, wherein at least one said osteotropic genes is a parathyroid 
hormone (PTH) gene, a bone morphogenetic protein (BMP) gene, a growth factor gene, a growth 
factor receptor gene, a cytokine gene or a chemotactic factor gene. 

116. (Amended) The method of claim 115, wherein at least one of said osteotropic genes is a 
transforming growth factor (TGF) gene, a fibroblast growth factor (FGF) gene, a 
granulocyte/macrophage colony stimulating factor (GMCSF) gene, an epidermal growth factor 
(EGF) gene, a platelet derived growth factor (PDGF) gene, an insulin-like growth factor (IGF) 
gene, a latent TGF-p binding protein (LTBP) gene or a leukemia inhibitory factor (LIF) gene. 

117. The method of claim 116, wherein at least one of said osteotropic genes is a TGF-ot, 
TGF-plorTGF-(32 gene. 

1 1 8. The method of claim 1 15, wherein at least one of said osteotropic genes is a PTH gene. 

1 19. The method of claim 118, wherein at least one of said osteotropic genes is a PTH1-34 gene. 

120. The method of claim 115, wherein at least one of said osteotropic genes is a BMP-2A, 
BMP-2B, BMP-3, BMP-4, BMP-5, BMP-6, BMP-7 or BMP-8 gene. 

121. The method of claim 120, wherein at least one of said osteotropic genes is a BMP-2 or 
BMP-4 gene. 

122. The method of claim 115, wherein said matrix-nucleic acid composition comprises a PTH 
gene and a BMP gene. 



123. The method of claim 122, wherein said matrix-nucleic acid composition comprises a 
PTH1-34 gene and a BMP-4 gene. 



124. (Amended) The method of claim 104, wherein said structural bone-compatible matrix is a 
collagenous, metal, hydroxylapatite, hydroxylapatite-coated metal, bioglass, aluminate, bioceramic, 
acrylic ester polymer, lactic acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer 
matrix. 



125. (Amended) The method of claim 124, wherein said structural bone-compatible matrix is a 
titanium matrix. 



126. (Amended) The method of claim 125, wherein said structural bone-compatible matrix is a 
titanium matrix coated with hydroxylapatite. 



127. (Amended) The method of claim 124, wherein said structural bone-compatible matrix is a 
collagen preparation. 



128. (Amended) The method of claim 127, wherein said structural bone-compatible matrix is a 
type II collagen preparation. 



129. (Amended) The method of claim 128, wherein said structural bone-compatible matrix is a 
recombinant type II collagen preparation. 



130. (Amended) The method of claim 128, wherein said structural bone-compatible matrix is a 
type II collagen preparation further supplemented with minerals. 

131. (Amended) The method of claim 124, wherein said structural bone-compatible matrix is a 
lactic acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

132. (Amended) A composition comprising two or more nucleic acid segments in association 
with a structural bone-compatible matrix. 



133. (Amended) The composition of claim 132, wherein said composition comprises two nucleic 
acid segments in association with said structural bone-compatible matrix. 

134. (Amended) The composition of claim 132, wherein said composition comprises three 
nucleic acid segments in association with said structural bone-compatible matrix. 

135. (Amended) The composition of claim 132, wherein said nucleic acid segments are absorbed 
in or adsorbed to said structural bone-compatible matrix. 

136. (Amended) The composition of claim 132, wherein said nucleic acid segments are 
impregnated within said structural bone-compatible matrix. 

137. The composition of claim 132, wherein at least one of said two or more nucleic acid 
segments is a DNA molecule. 

138. The composition of claim 132, wherein at least one of said two or more nucleic acid 
segments is an RNA molecule. 

139. The composition of claim 132, wherein at least one of said two or more nucleic acid 
segments is an antisense nucleic acid molecule. 

140. The composition of claim 132, wherein at least one of said two or more nucleic acid 
segments is a linear nucleic acid molecule, a plasmid or a recombinant insert within the genome of 
a recombinant virus. 



14L (Amended) The composition of claim 1 32, wherein at least one of said two or more nucleic 
acid segments encodes a polypeptide or protein that stimulates bone progenitor cells when 
expressed by said cells. 

142. (Amended) The composition of claim 132, wherein said structural bone-compatible matrix 
is a collagenous, titanium, hydroxylapatite, hydroxylapatite-coated titanium, bioglass, aluminate, 
bioceramic, acrylic ester polymer, lactic acid polymer, glycolic acid polymer or lactic acid/glycolic 
acid polymer matrix. 



143. (Amended) The composition of claim 142, wherein said structural bone-compatible matrix 
is a collagen preparation. 

144. (Amended) The composition of claim 143, wherein said structural bone-compatible matrix 
is a type II collagen preparation. 

145. (Amended) The composition of claim 144, wherein said structural bone-compatible matrix 
is a recombinant type II collagen preparation. 

146. (Amended) The composition of claim 144, wherein said structural bone-compatible matrix 
is a type II collagen preparation further supplemented with minerals. 

147. (Amended) The composition of claim 146, wherein said structural bone-compatible matrix 
is a type II collagen preparation further supplemented with calcium. 



148. (Amended) The composition of claim 142, wherein said structural bone-compatible matrix 
is a lactic acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

149. (Amended) A composition comprising two or more osteotropic genes in association with a 
structural bone-compatible matrix, said composition being capable of stimulating bone growth 
when administered to a bone progenitor tissue site of an animal. 

150. (Amended) The composition of claim 149, wherein said composition comprises two 
osteotropic genes in association with said structural bone-compatible matrix. 

151. (Amended) The composition of claim 149, wherein said composition comprises three 
osteotropic genes in association with said structural bone-compatible matrix. 

152. (Amended) The composition of claim 149, wherein said osteotropic genes are absorbed in 
or adsorbed to said structural bone-compatible matrix. 



153. (Amended) The composition of claim 149, wherein said osteotropic genes are impregnated 
within said structural bone-compatible matrix. 



154. The composition of claim 149, wherein at least one of said two or more osteotropic genes is 
in the form of plasmid DNA, a DNA insert within the genome of a recombinant adenovirus, a DNA 
insert within the genome of a recombinant adeno-associated virus (AAV) or a DNA insert within 
the genome of a recombinant retrovirus. 

155. (Amended) The composition of claim 149, wherein at least one of said two or more 
osteotropic genes is a PTH, BMP-2A, BMP-2B, BMP-3, BMP-4, BMP-5, BMP-6, BMP-7, BMP-8, 
TGF-oc, TGF-pl, TGF-p2, FGF, GMCSF, EGF, PDGF, IGF, LTBP or a LIF gene. 



1 56. (Amended) The composition of claim 1 55, wherein at least one of said osteotropic genes is 
a TGF-a, TGF-pi, TGF-p2, PTH, LTBP, BMP-2 or BMP-4 gene. 

1 57. The composition of claim 1 55, wherein at least one of said osteotropic genes is a PTH gene. 

158. The composition of claim 157, wherein at least one of said osteotropic genes is a PTH1-34 
gene. 

159. The composition of claim 155, wherein at least one of said osteotropic genes is a BMP-2 or 
BMP-4 gene. 

1 60. The composition of claim 1 55, wherein said composition comprises a PTH gene and a BMP 
gene. 

161. The composition of claim 160, wherein said composition comprises a PTH1-34 gene and a 
BMP-4 gene. 

162. (Amended) The composition of claim 149, wherein said structural bone-compatible matrix 
is a collagenous, metal, hydroxyapatite, bioglass, aluminate, bioceramic, acrylic ester polymer, 
lactic acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

163. (Amended) The composition of claim 162, wherein said structural bone-compatible matrix 
is a titanium matrix. 



164. (Amended) The composition of claim 163, wherein said structural bone-compatible matrix 
is a titanium matrix coated with hydroxylapatite. 



165. (Amended) The composition of claim 162, wherein said structural bone-compatible matrix 
is a collagen preparation. 

166. (Amended) The composition of claim 165, wherein said structural bone-compatible matrix 
is a type II collagen preparation. 

167. (Amended) The composition of claim 166, wherein said structural bone-compatible matrix 
is a recombinant type II collagen preparation. 

168. (Amended) The composition of claim 166, wherein said structural bone-compatible matrix 
is a type II collagen preparation further supplemented with minerals. 

169. (Amended) The composition of claim 168, wherein said structural bone-compatible matrix 
is a type II collagen preparation further supplemented with calcium. 

170. (Amended) The composition of claim 162, wherein said structural bone-compatible matrix 
is a lactic acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

171. (Amended) An osteotropic device, comprising two or more osteotropic genes capable of 
expression by bone progenitor cells, the genes associated with an amount of a structural bone- 
compatible matrix effective to absorb or adsorb said genes, wherein said device is capable of 
stimulating bone formation when implanted within a bone progenitor tissue site of an animal. 

172. The device of claim 171 , wherein said device comprises three or more osteotropic genes. 

173. The device of claim 171, wherein said device is a titanium or a hydroxylapatite-coated 
titanium device. 

174. The device of claim 171, wherein said device is shaped to join a bone fracture site in said 
animal. 

175. The device of claim 171, wherein said device is shaped to fill a bone cavity site in said 
animal. 



176. The device of claim 171, wherein said device is an artificial joint. 

177. The method of claim 51, wherein at least one of said two or more nucleic acid segments is 
an RNA molecule. 



178. The method of claim 85, wherein at least one of said two or more osteotropic genes is an 
LTBP gene. 



179. The method of claim 1 1 6, wherein at least one of said osteotropic genes is an LTBP gene. 



180. The device of claim 171, wherein said device comprises a PTH, BMP-2A, BMP-2B, BMP- 
3, BMP-4, BMP-5, BMP-6, BMP-7, BMP-8, TGF-a, TGF-pl, TGF-p2, FGF, GMCSF, EGF, 
PDGF, IGF, LTBP or a LIF gene. 



181. The device of claim 180, wherein said device comprises a TGF-a, TGF-pi, TGF-p2, 
PTH1-34, LTBP, BMP-2 or BMP-4 gene. 



1 82. The device of claim 1 80, wherein said device comprises a PTH gene and a BMP gene. 



EXHIBIT E 

Issued Claims in Great Grandparent, Patent No. 5,763,416 



1. A method for transferring a nucleic acid segment into bone progenitor cells located 
within a bone progenitor tissue site of an animal, comprising contacting said tissue site with a 
composition comprising a nucleic acid segment and a structural bone-compatible matrix so as to 
transfer said nucleic acid segment into said cells, wherein said nucleic acid segment expresses a 
transcriptional or translational product in said cells. 



2. The method of claim 1, wherein the contacting process comprises bringing said nucleic 
acid segment into contact with said structural bone-compatible matrix to form a matrix-nucleic 
acid segment composition and bringing said matrix-nucleic acid segment composition into 
contact with said tissue site. 



3. The method of claim 2, wherein said nucleic acid segment is absorbed in said structural 
bone-compatible matrix. 



4. The method of claim 2, wherein said nucleic acid segment is adsorbed to said structural 
bone-compatible matrix. 



5. The method of claim 2, wherein said nucleic acid segment is impregnated within said 
structural bone-compatible matrix.. 



6. The method of claim 1 , wherein said nucleic acid segment is a DNA molecule. 



7. The method of claim 1, wherein said nucleic acid segment is an RNA molecule. 



8. The method of claim 1, wherein said nucleic acid segment is an antisense nucleic acid 
molecule. 



9. The method of claim 1, wherein said nucleic acid segment is a linear nucleic acid 
molecule, a plasmid or a recombinant insert within the genome of a recombinant virus. 



10. The method of claim 1, wherein said nucleic acid segment encodes a polypeptide or 
protein that stimulates bone progenitor cells when expressed in said cells. 



1 L The method of claim 1, wherein said structural bone-compatible matrix is a collagenous, 
metal, hydroxylapatite, bioglass, aluminate, bioceramic, acrylic ester polymer, lactic acid 
polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

12. The method of claim 11, wherein said structural bone-compatible matrix is a titanium 
matrix. 



13. The method of claim 12, wherein said structural bone-compatible matrix is a titanium 
matrix coated with hydroxylapatite. 

14. The method of claim 1 1 , wherein said bone-compatible matrix is a collagen preparation. 

15. The method of claim 11, wherein said bone-compatible matrix is a lactic acid polymer, 
glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

16. The method of claim 1, wherein said bone progenitor cells are stem cells, macrophages, 
fibroblasts, vascular cells, osteoblasts, chondroblasts or osteoclasts. 

17. The method of claim 16, wherein said bone progenitor cells are fibroblasts. 

18. A method of stimulating bone progenitor cells located within a bone progenitor tissue site 
of an animal, comprising contacting said tissue site with a composition comprising an osteotropic 
gene and a structural bone-compatible matrix so as to promote expression of said gene in said 
cells. 



19. The method of claim 18, wherein expression of said gene in said cells stimulates said 
cells to promote bone tissue growth. 

20. The method of claim 18, wherein the contacting process comprises bringing said 
osteotropic gene into contact with said structural bone-compatible matrix to form a matrix-gene 
composition and bringing said matrix-gene composition into contact with said tissue site. 

21. The method of claim 20, wherein said nucleic acid segment is absorbed in said structural 
bone-compatible matrix. 



22. The method of claim 20, wherein said nucleic acid segment is adsorbed to said structural 
bone-compatible matrix. 

23. The method of claim 20, wherein said nucleic acid segment is impregnated within said 
structural bone-compatible matrix. 

24. The method of claim 18, wherein said osteotropic gene is in the form of plasmid DNA, a 
DNA insert within the genome of a recombinant adenovirus, a DNA insert within the genome of 
a recombinant adeno-associated virus (AAV) or a DNA insert within the genome of a 
recombinant retrovirus. 



25. The method of claim 18, wherein said osteotropic gene is a parathyroid hormone (PTH) 
gene, a bone morphogenetic protein (BMP) gene, a growth factor gene, a growth factor receptor 
gene, a cytokine gene or a chemotactic factor gene. 

26. The method of claim 25, wherein said osteotropic gene is a transforming growth factor 
(TGF) gene, a fibroblast growth factor (FGF) gene, a granulocyte/macrophage colony 
stimulating factor (GMCSF) gene, an epidermal growth factor (EGF) gene, a platelet derived 
growth factor (PDGF) gene, an insulin-like growth factor (IGF) gene, or a leukemia inhibitory 
factor (LIF) gene. 



27. The method of claim 26, wherein said osteotropic gene is a TGF-oc, TGF-J31 or TGF-(32 
gene. 

28. The method of claim 25, wherein said osteotropic gene is a PTH gene. 

29. The method of claim 28, wherein said osteotropic gene is a PTH1 -34. 



30. The method of claim 25, wherein said osteotropic gene is a BMP-2A, BMP-2B, BMP-3, 
BMP-4, BMP-5, BMP-6, BMP-7 or BMP-8 gene. 



3 1 . The method of claim 30, wherein said osteotropic gene is a BMP-2 or BMP-4 gene. 



32. The method of claim 1 8, wherein said structural bone-compatible matrix is a collagenous, 
metal, hydroxylapatite, bioglass, aluminate, bioceramic, acrylic ester polymer, lactic acid 
polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

33. The method of claim 32, wherein said structural bone-compatible matrix is a titanium 
matrix. 

34. The method of claim 33, wherein said structural bone-compatible matrix is a titanium 
matrix coated with hydroxylapatite. 

35. The method of claim 32, wherein said structural bone-compatible matrix is a collagen 
preparation. 

36. The method of claim 32, wherein said structural bone-compatible matrix is a lactic acid 
polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

37. The method of claim 19, wherein said matrix-gene composition is applied to a bone 
fracture site in said animal. 

38. The method of claim 19, wherein said matrix-gene composition is implanted within a 
bone cavity site in said animal. 

39. The method of claim 38, wherein said bone cavity site is the result of dental or 
periodontal surgery or the removal of an osteosarcoma. 

40. The method of claim 18, wherein said bone progenitor cells are stem cells, macrophages, 
fibroblasts, vascular cells, osteoblasts, chondroblasts or osteoclasts. 

41 . The method of claim 40, wherein said bone progenitor cells are fibroblasts. 

42. A composition comprising a nucleic acid segment in association with a structural bone- 
compatible matrix. 

43. The composition of claim 42, wherein said nucleic acid segment is absorbed in said 
structural bone-compatible matrix. 



44. The composition of claim 42, wherein said nucleic acid segment is adsorbed to said 
structural bone-compatible matrix. 

45. The composition of claim 42, wherein said nucleic acid segment is impregnated within 
said structural bone-compatible matrix. 

46. The composition of claim 42, wherein said nucleic acid segment is a DNA molecule. 

47. The composition of claim 42, wherein said nucleic acid segment is an RNA molecule. 

48. The composition of claim 42, wherein said nucleic acid segment is an antisense nucleic 
acid molecule. 

49. The composition of claim 42, wherein said nucleic acid segment is a linear nucleic acid 
molecule, a plasmid or a recombinant insert within the genome of a recombinant virus. 

50. The composition of claim 42, wherein said nucleic acid segment encodes a polypeptide or 
protein that stimulates bone progenitor cells when expressed in said cells. 

51. The composition of claim 42, wherein said structural bone-compatible matrix is a 
collagenous, titanium, hydroxylapatite, hydroxylapatite-coated titanium, bioglass, aluminate, 
bioceramic, acrylic ester polymer, lactic acid polymer, glycolic acid polymer or lactic 
acid/glycolic acid polymer matrix. 

52. The composition of claim 51, wherein said structural bone-compatible matrix is a 
collagen preparation. 

53. The composition of claim 51, wherein said bone-compatible matrix is a lactic acid 
polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

54. A composition comprising an osteotropic gene in association with a bone-compatible 
matrix, said composition being capable of stimulating bone growth when administered to a bone 
progenitor tissue site of an animal. 



55. The composition of claim 54, wherein said osteotropic gene is absorbed in said structural 
bone-compatible matrix. 



56. The composition of claim 54, wherein said osteotropic gene is adsorbed to said structural 
bone-compatible matrix. 

57. The composition of claim 54, wherein said osteotropic gene is impregnated within said 
structural bone-compatible matrix. 

58. The composition of claim 54, wherein said osteotropic gene is in the form of plasmid 
DNA, a DNA insert within the genome of a recombinant adenovirus, a DNA insert within the 
genome of a recombinant adeno-associated virus (AAV) or a DNA insert within the genome of a 
recombinant retrovirus. 



59. The composition of claim 54, wherein said osteotropic gene is a PTH, BMP-2A, BMP- 
2B, BMP-3, BMP-4, BMP-5, BMP-6, BMP-7, BMP-8, TGF-a, TGF-pl, TGF-p2, FGF, 
GMCSF, EGF, PDGF, IGF or a LIF gene. 



60. The composition claim 59, wherein said osteotropic gene is a TGF-a, TGF-pi, TGF-f52, 
PTH, BMP-2 or BMP-4 gene. 

61 . The composition of claim 60, wherein said osteotropic gene is a PTH1 -34 gene. 

62. The composition of claim 54, wherein said structural bone-compatible matrix is a 
collagenous, metal, hydroxyapatite, bioglass, aluminate, bioceramic, acrylic ester polymer, lactic 
acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

63. The composition of claim 62, wherein said structural bone-compatible matrix is a 
titanium matrix. 

64. The composition of claim 63, wherein said structural bone-compatible matrix is a 
titanium matrix coated with hydroxylapatite. 



65. The composition of claim 62, wherein said structural bone-compatible matrix is a 
collagen preparation. 



66. The composition of claim 62, wherein said structural bone-compatible matrix is a lactic 
acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 

67. An osteotropic device, comprising an osteotropic gene capable of expression in bone 
progenitor cells, the gene associated with an amount of a structural bone-compatible matrix 
effective to absorb said gene, wherein said device is capable of stimulating bone formation when 
implanted within a bone progenitor tissue site of an animal. 

68. The device of claim 67, wherein said device is a titanium or a hydroxylapatite-coated 
titanium device. 

69. The device of claim 67, wherein said device is shaped to join a bone fracture site in said 
animal. 

70. The device of claim 67, wherein said device is shaped to fill a bone cavity site in said 
animal. 

71 . The device of claim 67, wherein said device is an artificial joint. 

72. A method of delivering a nucleic acid segment to a fibroblast cell located within a repair 
tissue site of an animal, comprising contacting said tissue site with a composition comprising a 
nucleic acid segment and a structural bone-compatible matrix to effect uptake of the nucleic acid 
segment into the fibroblast cell and to promote expression of a transcriptional or translational 
product in said cell. 

73. A method of delivering at least one selected nucleic acid segment to a fibroblast cell 
located within a repair tissue site of an animal, comprising the steps of: 

(a) preparing a matrix-nucleic acid composition comprising at least one nucleic acid 
segment and a structural bone-compatible matrix; and 

(b) contacting said repair tissue site with the structural matrix-nucleic acid 
composition to effect uptake of the nucleic acid segment by the fibroblast cell, 
wherein said nucleic acid segment expresses a transcriptional or translational 
product in said cell. 



74. The method of claim 73, wherein said nucleic acid segment is a linear nucleic acid 
molecule, a plasmid or a recombinant insert within the genome of a recombinant virus. 

75. The method of claim 73, wherein said structural bone-compatible matrix is a collagenous, 
metal, hydroxylapatite, hydroxylapatite-coated metal, bioglass, aluminate, bioceramic, acrylic 
ester polymer, lactic acid polymer, glycolic acid polymer or lactic acid/glycolic acid polymer 
matrix. 

76. The method of claim 75, wherein said structural bone-compatible matrix is a collagenous 
matrix. 

77. The method of claim 75, wherein said structural bone-compatible matrix is a lactic acid 
polymer, glycolic acid polymer or lactic acid/glycolic acid polymer matrix. 



EXHIBIT F 

Original Claims in Grandparent, Patent No. 5,763,416 



1. A method for transferring a nucleic acid segment into bone progenitor cells, comprising 
contacting bone progenitor cells with a composition comprising a nucleic acid segment so as to 
transfer said nucleic acid segment into said cells. 

2. The method of claim 1, wherein said cells are located within a bone progenitor tissue site 
of an animal and said tissue site is contacted with said composition so as to promote nucleic acid 
transfer into bone progenitor cells in situ. 

3. The method of claim 2, wherein the contacting process comprises bringing said nucleic 
acid segment into contact with a bone-compatible matrix to form a matrix-nucleic acid segment 
composition and bringing said matrix-nucleic acid segment composition into contact with said 
tissue site. 

4. The method of claim 1, wherein said nucleic acid segment is a DNA molecule. 

5. The method of claim 1, wherein said nucleic acid segment is an RNA molecule. 

6. The method of claim 1, wherein said nucleic acid segment is an antisense nucleic acid 
molecule. 

7. The method of claim 1, wherein said nucleic acid segment is a linear nucleic acid 
molecule, a plasmid or a recombinant insert within the genome of a recombinant virus. 

8. The method of claim 1, wherein said nucleic acid segment encodes a polypeptide or 
protein that stimulates bone progenitor cells when expressed in said cells. 

9. The method of claim 3, wherein said bone-compatible matrix is a collagenous, metal, 
hydroxylapatite, bioglass, aluminate, bioceramic, acrylic ester polymer or lactic acid polymer 
matrix. 

10. The method of claim 9, wherein said bone-compatible matrix is a titanium matrix. 



1 1 . The method of claim 1 0, wherein said bone-compatible matrix is a titanium matrix coated 
with hydroxy lapatite. 

12. The method of claim 9, wherein said bone-compatible matrix is a collagen preparation. 

13. A method of stimulating bone progenitor cells, comprising contacting bone progenitor 
cells with a composition comprising an osteotropic gene so as to promote expression of said gene 
in said cells. 



14. The method of claim 13, wherein said cells are located within a bone progenitor tissue 
site of an animal and said tissue site is contacted with said composition so as to promote bone 
tissue growth. 

15. The method of claim 14, wherein the contacting process comprises bringing said 
osteotropic gene into contact with a bone-compatible matrix to form a matrix-gene composition 
and bringing said matrix -gene composition into contact with said tissue site. 

16. The method of claim 13, wherein said osteotropic gene is in the form of plasmid DNA, a 
DNA insert within the genome of a recombinant adenovirus, a DNA insert within the genome of 
a recombinant adeno-associated virus (AAV) or a DNA insert within the genome of a 
recombinant retrovirus. 



17. The method of claim 13, wherein said osteotropic gene is a parathyroid hormone (PTH) 
gene, a bone morphogenetic protein (BMP) gene, a growth factor gene, a growth factor receptor 
gene, a cytokine gene or a chemotactic factor gene. 

18. The method of claim 17, wherein said osteotropic gene is a transforming growth factor 
(TGF) gene, a fibroblast growth factor (FGF) gene, a granulocyte/macrophage colony 
stimulating factor (GMCSF) gene, an epidermal growth factor (EGF) gene, a platelet derived 
growth factor (PDGF) gene, an insulin-like growth factor (IGF) gene, or a leukemia inhibitory 
factor (LIF) gene. 



19. The method of claim 18, wherein said osteotropic gene is a TGF-a, TGF-pl or TGF-P2 
gene. 



20. 



The method of claim 17, wherein said osteotropic gene is a PTH gene. 



21. The method of claim 17, wherein said osteotropic gene is a BMP-2A, BMP-2B, BMP-3, 
BMP-4, BMP-5, BMP-6, BMP-7 or BMP-8 gene. 

22. The method of claim 2 1 , wherein said osteotropic gene is a BMP-2 or BMP-4 gene. 

23. The method of claim 15, wherein said bone-compatible matrix is a collagenous, metal, 
hydroxylapatite, bioglass, aluminate, bioceramic, acrylic ester polymer or lactic acid polymer 
matrix. 

24. The method of claim 23 , wherein said bone-compatible matrix is a titanium matrix. 

25 . The method of claim 24, wherein said bone-compatible matrix is a titanium matrix coated 
with hydroxylapatite. 

26. The method of claim 23, wherein said bone-compatible matrix is a collagen preparation. 

27. The method of claim 15, wherein said matrix-gene composition is applied to a bone 
fracture site in said animal. 

28. The method of claim 15, wherein said matrix-gene composition is implanted within a 
bone cavity site in said animal. 

29. The method of claim 15, wherein said bone cavity site is the result of dental or 
periodontal surgery or the removal of an osteosarcoma. 

30. A composition comprising a nucleic acid segment in association with a bone-compatible 
matrix. 

31. The composition of claim 30, wherein said nucleic acid segment is a DNA molecule. 

32. The composition of claim 30, wherein said nucleic acid segment is an RNA molecule. 



33. The composition of claim 30, wherein said nucleic acid segment is an antisense nucleic 
acid molecule. 

34. The composition of claim 30, wherein said nucleic acid segment is a linear nucleic acid 
molecule, a plasmid or a recombinant insert within the genome of a recombinant virus. 

35. The composition of claim 30, wherein said nucleic acid segment encodes a polypeptide or 
protein that stimulates bone progenitor cells when expressed in said cells. 

36. The composition of claim 30, wherein said bone-compatible matrix is a collagenous, 
titanium, hydroxylapatite, hydroxylapatite-coated titanium, bioglass, aluminate, bioceramic, 
acrylic ester polymer or lactic acid polymer matrix. 

37. The composition of claim 36, wherein said bone-compatible matrix is a collagen 
preparation. 

38. A composition comprising an osteotropic gene in association with a bone-compatible 
matrix, said composition being capable of stimulating bone growth when administered to a bone 
progenitor tissue site of an animal. 

39. The composition of claim 38, wherein said osteotropic gene is in the form of plasmid 
DNA, a DNA insert within the genome of a recombinant adenovirus, a DNA insert within the 
genome of a recombinant adeno-associated virus (AAV) or a DNA insert within the genome of a 
recombinant retrovirus. 



40. The composition of claim 38, wherein said osteotropic gene is a PTH, BMP-2A, 
BMP-2B, BMP-3, BMP-4, BMP-5, BMP-6, BMP-7, BMP-8, TGF-cc, TGF-pi, TGF-(52, FG, 
GMCSF, EGF, PDGF, IGF or a LIF gene. 

41. The composition claim 40, wherein said osteotropic gene is a TGF-a, TGF-pi, TGF-p2, 
PTH, BMP-2 or BMP-4 gene. 

42. The composition of claim 38, wherein said bone-compatible matrix is a collagenous, 
metal, hydroxyapatite, bioglass, aluminate, bioceramic, acrylic ester polymer or lactic acid 
polymer matrix. 



43. The composition of claim 42, wherein said bone-compatible matrix is a titanium matrix. 



44. The composition of claim 43, wherein said bone-compatible matrix is a titanium matrix 
coated with hydroxylapatite. 



45. The composition of claim 42, wherein said bone-compatible matrix is a collagen 
preparation. 



46. An osteotropic device, comprising an osteotropic gene capable of expression in bone 
progenitor cells, the gene associated with an amount of a bone-compatible matrix effective to 
absorb said gene, wherein said device is capable of stimulating bone formation when implanted 
within a bone progenitor tissue site of an animal. 



47. The device of claim 46, wherein said device is a titanium or a hydroxylapatite-coated 
titanium device. 



48. The device of claim 46, wherein said device is shaped to join a bone fracture site in said 
animal. 



49. The device of claim 46, wherein said device is shaped to fill a bone cavity site in said 
animal 



50. 



The device of claim 46, wherein said device is an artificial joint. 
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BACKGROUND OF THE INVENTION 



The present application is a continuation-in-part of PCT/US95/02251, filed February 
21, 1995; which is a continuation-in-part of U.S. Serial Number 08/316,650, filed September 
30, 1994; which is a continuation-in-part of U.S. Serial Number 08/199,780, filed February 
18, 1994; the entire text and figures of which disclosures are specifically incorporated herein 
by reference without disclaimer. The United States government has certain rights in the 
present invention pursuant to Grant HL-41926 from the National Institutes of Health. 

A. Field of the Invention 

The present invention relates generally to the field of molecular biology. More 
particularly it relates to latent TGF/3 binding protein (LTBP) genes, compositions and 
methods of use. 

B. Description of the Related Art 

1. TGF-jS 

Five TGF-jS family members, which share 66-82% sequence identity, have been 
identified (Kingsley, 1994). Whereas TGF-j31 was cloned from a cDNA library derived 
from human placenta, TGF-/32 was subsequently purified from several mammalian cells and 
tissues, and TGF-j33, -/34, and -/35 were cloned by low stringency hybridization from 
mammalian, avian and amphibian cDNA libraries, respectively. Peptide growth 
factors/cytokines have also been identified that share sequence homology ( <40%) with the 
TGF-/8s (collectively, the TGF-/?s plus these other cytokines make up the TGF-jS 
superfamily). A unifying feature of the biology of these other cytokines (i.e. , the Mullerian 
inhibiting substance, bone morphogenetic proteins, growth and differentiation factors, 
activin/inhibin, Drosophila decapentaplegic complex, and amphibian Vgl protein) is the 
ability to regulate developmental processes. In every case where information is available, 



superfamily members are synthesized as larger precursors that are processed at 
endoproteolytic cleavage motifs, and they terminate with the sequence C-X-C-X. The three 
dimensional crystal structure of the TGF-02 homodimer was recently reported (McDonald 
and Hendrickson, 1993). This work has led to the interesting and novel suggestion that 
5 TGF-/J is related to certain peptide growth factors (e.g. , NGF, PDGF, v-SIS) in a way that 
could not have been predicted from the deduced amino acid analysis. 

2. Latent TGF-jS Complexes 

10 Many cell types produce TGF-j3, and almost all cells bind TGF-/3 with affinities in the 

picomolar range — e.g. , the type I and type II TGF-/3 cell surface receptors (glycoproteins of 
53 and 75 kDa, respectively) are present in essentially all cells (Miyazono et al, 1994). 
o Thus, TGF-/3 has powerful effects on most cell types, and cytokines such as TGF-jS are 
^ thought to exert broad control the tissue remodeling that occurs during development, wound 
|5 repair, and other situations (Sporn et al , 1986; Moses et al. , 1990). (For a comprehensive 
31 review of TGF-/3 effects, see Roberts and Sporn, 1990). For example, TGF-jS was initially 
identified as a factor that stimulated the anchorage independent growth of rodent fibroblasts 
^ (Assoian et al, 1983; Frolik et al, 1983; Roberts et al, 1983). It is now known, however, 
Lf; that TGF-/3 acts as a potential growth inhibitor for most cells, i.e., epithelial, endothelial, 
j|p and hematopoietic progenitor cells; both stimulates and inhibits cellular differentiation; 
D induces extracellular matrix production by stimulating the expression of matrix 

macromolecules, stimulating the expression of matrix protease inhibitors, and decreasing the 
expression of matrix degrading proteases; inhibits the functional activities of immune cells; 
induces the chemotaxis of fibroblasts, macrophages, and smooth muscle cells; induces 
25 angiogenesis in vivo; inhibits endothelial migration; induces the expression of cell surface 
receptors for other cytokines (e.g., the EGF receptor); promotes the healing of incisional 
wounds; inhibits osteoblast proliferation in vitro; and induces new bone formation in vivo. 

A molecular explanation for these complex (and, at times, conflicting) effects is not 
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yet available, but hypotheses do exist. Sporn et al (1986) have suggested, for example, that 
the ability of TGF-/3 to stimulate or inhibit the proliferation of mesenchymal cells depends on 
the state of cellular differentiation and the entire set of growth factors operant in that cell 
population. As such, the "biological meaning" of TGF-/J signal transduction depends on the 
context {i.e., availability and presentation) of other growth factors present in the local 
environment: Fischer rat 3T3 cells transfected with a myc gene and incubated with TGF-/3 
and PDGF proliferate in soft agar, whereas the sane cells in the presence of TGF-/3 and 
EGF fail to grow (Roberts et al, 1985). 

Whatever the mechanism, the autocrine and paracrine activities of TGF-j8 clearly must 
be regulated with precision. One regulatory strategy involves the temporal and spatial 
control of TGF-/3 gene expression. A second strategy involves the production and storage of 
TGF-/J as a latent complex that is activated only under certain physiological and pathological 
conditions — e.g., tissue morphogenesis and remodeling, and wound healing. TGF-/31 can 
be isolated from serum and from most tissues as a latent complex (Pircher et al, 1986; 
Miyazono et al , 1988; Wakefield et al , 1988). In this regard, the latent complex has been 
purified from human platelets and characterized in detail (Miyazono et al , 1988). Following 
a 6-step protocol, the purified complex yielded protein bands of Mr 25,000 and 210,000 on 
SDS-PAGE under nonreducing conditions. After reduction, the 25 kDa band was shown to 
consist of subunits of Mr 12,500. On the other hand, the 210 kDa band consisted of a Mr 
40,000 subunit and Mr 125-160,000 subunit. 

TGF-0 is also secreted from several producer cell lines in culture as a latent complex 
of 235 kDa (Gentry et al , 1987). TGF-jSl is initially synthesized in vitro as a 390 amino 
acid precursor that consists of a signal peptide, an amino-terminal propeptide, and the mature 
growth factor. Two precursor chains associate to form a disulfide-bonded dimer with latent 
activity. Homodimers occur most commonly, but heterodimers may also form (Ogawa et 
al , 1992). The full length dimer is cleaved at a endoproteolytic cleavage motif, but the 
propeptide dimer {i.e. , the latency associated peptide or LAP) and the mature growth factor 



dimer typically remain non-covalently associated. The mature TGF-& dimer is now known 
to be the 25 kDa band identified after nonreducing SDS-PAGE of the purified latent complex 
from platelets. In addition, LAP is known to be a component of the 210 kDa band identified 
after nonreducing SDS-PAGE of the purified latent complex from platelets (i.e., LAP has 
5 been shown to consist of two of the 40 kDa subunits). 

Together LAP and the mature TGF-jS dimer form the small latent complex. As 
demonstrated in platelets, small latent complexes may be associated with additional high 
molecular weight proteins, the best characterized of which is the latent TGF-/J binding 
10 protein or LTBP (Kanzaki et al , 1990). (LTBP has been shown to be the 125-160 kDa 

subunit of the purified latent complex from platelets). Latent TGF-/3 complexes that contain 
LTBP are also known as large latent complexes. In contrast to platelet LTBP, the LTBP 
O produced by fibroblasts typically is a 190 kDa polypeptide. The smaller size of platelet 
yi LTBP may be due to proteolytic processing or alternative splicing (Kanzaki et al , 1990; 
J{5 Tsuji etal, 1990). 

Kj 

Ul 3. LAP and Latency ^ 

f'; TGF-j3 latency results in part from the non-covalent association of the propeptide 

(§0 dimer and the mature TGF-/3 dimer (Pircher et al. , 1984; Gentry et al. , 1988; Wakefield et 

v.'"""z 

% al. , 1989). A cDNA for the TGF-j31 precursor was expressed in Chinese Hamster Ovary 
(CHO) cells, which do not express LTBP (Gentry et al., 1988), and almost all TGF-/3 
activity recovered from the medium of transfected cells was latent. Use of deletion 
constructs has demonstrated that synthesis of biologically active TGF-|31 can proceed only 

25 from the first ATG codon, implicating LAP in the proper assembly of the small latent 
complex in these cells. Taken together, these studies indicate that LAP is sufficient to 
achieve the latent state. More recent studies have shown that carbohydrate structures within 
LAP make an important contribution to the latent state. For example, treatment of the latent 
form of TGF-iSl with endoglycosidase F led to activation of TGF-/3 (Miyazono and Heldin, 
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1989). (The structure of the mature TGF-/J dimer was not affected by enzyme treatment). 
In particular, sialic acid residues seemed to be important, as treatment of the purified latent 
complex with sialidase was also able to activate TGF-/3 from the latent state. 

5 4. Modulation of Latency 

Latent complexes must be dissociated to activate mature TGF-/?, aM dissociation is 
considered to be a critical step in governing TGF-0 effects (Twardzik et al , 1990; Sato et 
al , 1993). Dissociation by chemical treatment of the latent complex purified from platelets 
10 has been investigated (Miyazono et al , 1990). Incubation of the purified complex under 

conditions of varying pH revealed that TGF-j8 activity was unmasked at values below pH 3.5 
and above pH 12.5. Incubation of latent TGF-/? in 0.02% SDS or 8 M urea also effectively 
J;] unmasked TGF-j3 activity, but incubation in 5 M NaCl did not. Wakefield et al (Wakefield 
u j et al , 1989) have reported that, after activation, TGF-|81 and LAP reassociate in a time- and 
rj|5 concentration-dependent manner under neutral, nondenaturing conditions. These results are 
consistent with the idea that the mature TGF-/3 dimer is non-covalently associated with LAP. 

□ Latent TGF-/3 complexes are also dissociated by the action of certain enzymes. For 

^ example, latent TGF-/? is activated by plasmin, which disrupts the structure of the large 
mo latent complex (Lyons et al , 1988; Taipale et al , 1995). Similar data exist for other 
q enzymes, e.g., cathepsin D, mast cell chymase, leukocyte elastase, and the glycosidases. 
Recently, osteoclast-derived cells were shown to be capable of activating latent TGF-/3 in 
vitro (Oreffo et al , 1989). Osteoclast activation is of particular interest because of the 
hypothesis that TGF-/3 serves as a link between bone turnover and formation during bone 
25 remodeling (Centrella et al , 1991). The mechanism of TGF-/3 activation by osteoclasts is 

not known at present, but it is reasonable to think that local alteration of pH due to action of 
proton pumps in the osteoclast plasma membrane or the release of osteoclast-derived 
proteases may be involved in the activation process. Related to these observations, activated 
macrophages (as might be found at a wound site or during tissue morphogenesis) secrete 
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sialidase and other proteases (Pilatte et al , 1987), and they can lower the local pH to 4.0 
(Silver et al , 1988), both of which could contribute to TGF-/J activation in vivo. As 
mentioned above, acidification weakens the non-covalent interaction between LAP and the 
mature TGF-/3 dimer (Wakefield et al, 1989). 

5 

SUMMARY OF THE INVENTION 

The present invention concerns in an overall and general sense novel DNA segments 
and recombinant vectors encoding LTBP-2 or LTBP-3, and the creation and use of 
10 recombinant host cells through the application of DNA technology, that express LTBP-2 or 
LTBP-3 gene products. As such, the invention concerns DNA segment comprising an 
isolated gene that encodes a protein or peptide that includes an amino acid sequence 
% essentially as set forth by a contiguous sequence from SEQ ID NO:2 or SEQ ID NO:4. 
U1 These DNA segments are represented by those that include a nucleic acid sequence 

f§5 essentially as set forth by a contiguous sequence from SEQ ID NO:l or SEQ ID NO:3. 

in 

?sr - 

W Competitions that include a purified protein that has an amino acid sequence 

a; 

□ essentially as set forth by the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:4 are 
f * also encompassed by the invention. 

reo 

q The TGF-jSs represent a family of structurally related molecules with diverse effects 

on mammalian cell shape, growth, and differentiation (Roberts and Sporn, 1990). Initially 
synthesized as a precursor consisting of an ammo^errninal propeptide followed by mature 
TGF-/3, two chains of nascent pro-TGF-|3 associate in most tissues to form a Mr ~ 106,000 
25 inactive disulfide-bonded dimer. Homodimers are most common, but heterodimers have also 
been described (Cheifetz et al, 1987; Ogawa et al, 1992). During biosynthesis the mature 
TGF-/3 dimer is cleaved from the propeptide dimer. TGF-/3 latency results in part from the 
non-covalent association of propeptide and mature TGF-/3 dimers (Pircher et al, 1984, 1986; 
Wakefield et al, 1987; Millan et al, 1992; Miyazono and Heldin, 1989). Consequently, the 
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propeptide dimer is often referred to as the latency associated protein (LAP), and LAP plus 
the disulfide-bonded TGF-jS dimer are also known as the small latent complex. In the 
extracellular space small latent complexes must be dissociated to activate mature TGF-/3. 
The mechanism of activation of the latent complex is thought to be one of the most important 
5 steps governing TGF-0 effects (Lyons et al, 1988; Antonelli-Orlidge et al, 1989; Twardzik 
etal, 1990; Sato et al, 1993). 

In certain lines of cultured cells small latent growth factor complexes may contain 
additional high molecular weight proteins. The best characterized of these high molecular 
10 weight proteins is the latent TGF-|3 binding protein, or LTBP (Miyazono et al, 1988; 
Kanzaki et al, 1990; Tsuji et al, 1990; Olofsson et al, 1992; Taketazu et al, 1994). 
LTBP produced by different cell types is heterogeneous in size, perhaps because of 
Jj alternative splicing or because of tissue-specific proteolytic processing (Miyazono et al, 
g 1988; Wakefield et al, 1988; Kanzaki et al, 1990; Tsui et al, 1990). Latent TGF-/3 
ftS complexes that contain LTBP are known as large latent complexes. LTBP has no known 
^ covalent linkage to mature TGF-|6, but rather it is linked by a disulfide bond to LAP. 

O Regarding the novel protein LTBP-2 or LTBP-3, the present invention concerns DNA 

rip 

?V segments, that can be isolated from virtually any mammalian source, that are free from total 
SO genomic DNA and that encode proteins having LrBP-2-like or LTBP-3-like activity. DNA 
p segments encoding LTBP-2-like or LTBP-3-like species may prove to encode proteins, 
polypeptides, subunits, functional domains, and the like. 

As used herein, the term "DNA segment" refers to a DNA molecule that has been 
25 isolated free of total genomic DNA of a particular species. Therefore, a DNA segment 
encoding LTBP-2 or LTBP-3 refers to a DNA segment that contains LTBP-2 or LTBP-3 
coding sequences yet is isolated away from, or purified free from, total genomic DNA of the 
species from which the DNA segment is obtained. Included within the term "DNA 
segment", are DNA segments and smaller fragments of such segments, and also recombinant 
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vectors, including, for example, plasmids, cosmids, phagemids, phage, viruses, and the like. 

Similarly, a DNA segment comprising an isolated or purified LTBP-2 or LTBP-3 
gene refers to a DNA segment including LTBP-2 or LTBP-3 coding sequences and, in 
certain aspects, regulatory sequences, isolated substantially away from other naturally 
occurring genes or protein encoding sequences. In this respect, the term "gene" is used for 
simplicity to refer to a functional protein, polypeptide or peptide encoding unit. As will be 
understood by those in the art, this functional term includes both genomic sequences, cDNA 
sequences and smaller engineered gene segments that express, or may be adapted to express, 
proteins, polypeptides or peptides. 

"Isolated substantially away from other coding sequences" means that the gene of 
interest, in this case, a gene encoding LTBP-2 or LTBP-3, forms the significant part of the 
coding region of the DNA segment, and that the DNA segment does not contain large 
portions of naturally-occurring coding DNA, such as large chromosomal fragments or other 
functional genes or cDNA coding regions. Of course, this refers to the DNA segment as 
originally isolated,_and does not exclude genes or coding regions later added to the segment 
by the hand of man. 

In particular embodiments, the invention concerns isolated DNA segments and 
recombinant vectors incorporating DNA sequences that encode an LTBP-2 or LTBP-3 
species that includes within its amino acid sequence an amino acid sequence essentially as set 
forth in SEQ ID NO:2 or SEQ ID NO:4. In other particular embodiments, the invention 
concerns isolated DNA segments and recombinant vectors incorporating DNA sequences that 
include within their sequence a nucleotide sequence essentially as set forth in SEQ ID NO:l 
or SEQIDNO:3. 

The term "a sequence essentially as set forth in SEQ ID NO:2 or SEQ ID NO:4" 
means that the sequence substantially corresponds to a portion of SEQ ID NO:2 or SEQ ID 
NO:4 and has relatively few amino acids that are not identical to, or a biologically functional 



equivalent of, the amino acids of SEQ ID NO:2 or SEQ ID NO:4. The term "biologically 
functional equivalent" is well understood in the art and is further defined in detail herein (for 
example, see section 7, preferred embodiments). Accordingly, sequences that have between 
about 70% and about 80%; or more preferably, between about 81% and about 90%; or even 
more preferably, between about 91 % and about 99%; of amino acids that are identical or 
functionally equivalent to the amino acids of SEQ ID NO:2 or SEQ ID NO:4 will be 
sequences that are "essentially as set forth in SEQ ID NO:2 or SEQ ID NO:4". 

In certain other embodiments, the invention concerns isolated DNA segments and 
recombinant vectors that include within their sequence a nucleic acid sequence essentially as 
set forth in SEQ ID NO:l or SEQ ID NO:3. The term "essentially as set forth in SEQ ID 
NO: 1 or SEQ ED NO:3" is used in the same sense as described above and means that the 
nucleic acid sequence substantially corresponds to a portion of SEQ ID NO:l or SEQ ID 
NO:3 and has relatively few codons that are not identical, or functionally equivalent, to the 
codons of SEQ ID NO:l or SEQ ID NO:3. Again, DNA segments that encode proteins 
exhibiting LTBP-2-like or LTBP-3-like activity will be most preferred. 

It will also be understood that amino acid and nucleic acid sequences may include 
additional residues, such as additional N- or C-terminal amino acids or 5' or 3' sequences, 
and yet still be essentially as set forth in one of the sequences disclosed herein, so long as 
the sequence meets the criteria set forth above, including the maintenance of biological 
protein activity where protein expression is concerned. The addition of terminal sequences 
particularly applies to nucleic acid sequences that may, for example, include various non- 
coding sequences flanking either of the 5' or 3' portions of the coding region or may include 
various internal sequences, i.e., introns, which are known to occur within genes. 

Naturally, the present invention also encompasses DNA segments that are 
complementary, or essentially complementary, to the sequence set forth in SEQ ID NO:l or 
SEQ ID NO:3. Nucleic acid sequences that are "complementary" are those that are capable 
of base-pairing according to the standard Watson-Crick complementarity rules. As used 
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herein, the term "complementary sequences" means nucleic acid sequences that are 
substantially complementary, as may be assessed by the same nucleotide comparison set forth 
above, or as defined as being capable of hybridizing to the nucleic acid segment of SEQ ID 
NO:l or SEQ ID NO:3, under relatively stringent conditions such as those described herein. 

The nucleic acid segments of the present invention, regardless of the length of the 
coding sequence itself, may be combined with other DNA sequences, such as promoters, 
polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other 
coding segments, and the like, such that their overall length may vary considerably. It is 
therefore contemplated that a nucleic acid fragment of almost any length may be employed, 
with the total length preferably being limited by the ease of preparation and use in the 
intended recombinant DNA protocol. For example, nucleic acid fragments may be prepared 
that include a short contiguous stretch identical to or complementary to SEQ ID NO:l or 
SEQ ID NO:3, such as about 14 nucleotides, and that are up to about 10,000 or about 5,000 
base pairs in length, with segments of about 3,000 being preferred in certain cases. DNA 
segments with total lengths of about 1,000, about 500, about 200, about 100 and about 50 
base pairs in length (including all intermediate lengths) are also contemplated to be useful. 

It will be readily understood that "intermediate lengths", in these contexts, means any 
length between the quoted ranges, such as 14, 15, 16, 17, 18, 19, 20, etc.; 21, 22, 23, etc.; 
30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, etc.; 150, 151, 152, 153, etc.; 
including all integers through the 200-500; 500-1,000; 1,000-2,000; 2,000-3,000; 3,000- 
5,000; 5,000-10,000 ranges, up to and including sequences of about 12,001, 12,002, 13,001, 
13,002 and the like. 

It will also be understood that this invention is not limited to the particular nucleic 
acid and amino acid sequences of SEQ ID NO:l or SEQ ID NO:3; and SEQ ID NO:2 or 
SEQ ID NO:4, respectfully. Recombinant vectors and isolated DNA segments may therefore 
variously include the LTBP-2 or LTBP-3 coding regions themselves, coding regions bearing 
selected alterations or modifications in the basic coding region, or they may encode larger 
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polypeptides that nevertheless include LTBP-2 or LTBP-3-coding regions or may encode 
biologically functional equivalent proteins or peptides that have variant amino acids 
sequences. 

The DNA segments of the present invention encompass biologically functional 
equivalent LTBP-2 or LTBP-3 proteins and peptides. Such sequences may arise as a 
consequence of codon redundancy and functional equivalency that are known to occur 
naturally within nucleic acid sequences and the proteins thus encoded. Alternatively, 
functionally equivalent proteins or peptides may be created via the application of recombinant 
DNA technology, in which changes in the protein structure may be engineered, based on 
considerations of the properties of the amino acids being exchanged. Changes designed by 
man may be introduced through the application of site-directed mutagenesis techniques, e.g., 
to introduce improvements to the antigenicity of the protein or to test mutants in order to 
examine activity at the molecular level. 

If desired, one may also prepare fusion proteins and peptides, e.g. , where the LTBP-2 
or LTBP-3 coding jregions are aligned within the same expression unit with other proteins or 
peptides having desired functions, such as for purification or immunodetection purposes 
{e.g., proteins that may be purified by affinity chromatography and enzyme label coding 
regions, respectively). 

Recombinant vectors form further aspects of the present invention. Particularly useful 
vectors are contemplated to be those vectors in which the coding portion of the DNA 
segment, whether encoding a full length protein or smaller peptide, is positioned under the 
control of a promoter. The promoter may be in the form of the promoter that is naturally 
associated with a LTBP-2 or LTBP-3 gene, as may be obtained by isolating the 5' non- 
coding sequences located upstream of the coding segment or exon, for example, using 
recombinant cloning and/or PCR™ technology, in connection with the compositions disclosed 
herein. 
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In other embodiments, it is contemplated that certain advantages will be gained by 
positioning the coding DNA segment under the control of a recombinant, or heterologous, 
promoter. As used herein, a recombinant or heterologous promoter is intended to refer to a 
promoter that is not normally associated with an LTBP-2 or LTBP-3 gene in its natural 
environment. Such promoters may include LTBP-2 or LTBP-3 promoters normally 
associated with other genes, and/or promoters isolated from any bacterial, viral, eukaryotic, 
or mammaliar cell. Naturally, it will be important to employ a promoter that effectively 
directs the expression of the DNA segment in the cell type, organism, or even animal, 
chosen for expression. The use of promoter and cell type combinations for protein 
expression is generally known to those of skill in the art of molecular biology, for example, 
see Sambrook et ah , 1989. The promoters employed may be constitutive, or inducible, and 
can be used under the appropriate conditions to direct high level expression of the introduced 
DNA segment, such as is advantageous in the large-scale production of recombinant proteins 
or peptides. Appropriate promoter systems contemplated for use in high-level expression 
include, but are not limited to, the Pichia expression vector system (Pharmacia LKB 
Biotechnology). 

In connection with expression embodiments to prepare recombinant LTBP-2 or LTBP- 
3 proteins and peptides, it is contemplated that longer DNA segments will most often be 
used, with DNA segments encoding the entire LTBP-2 or LTBP-3 protein or functional 
domains, subunits, etc. being most preferred. However, it will be appreciated that the use of 
shorter DNA segments to direct the expression of LTBP-2 or LTBP-3 peptides or epitopic 
core regions, such as may be used to generate anti- LTBP-2 or LTBP-3 antibodies, also falls 
within the scope of the invention. DNA segments that encode peptide antigens from about 
15 to about 50 amino acids in length, or more preferably, from about 15 to about 30 amino 
acids in length are contemplated to be particularly useful. 

The LTBP-2 or LTBP-3 gene and DNA segments may also be used in connection 
with somatic expression in an animal or in the creation of a transgenic animal. Again, in 
such embodiments, the use of a recombinant vector that directs the expression of the Ml 
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length or active LTBP-2 or LTBP-3 protein is particularly contemplated. 

In addition to their use in directing the expression of the LTBP-2 or LTBP-3 protein, 
the nucleic acid sequences disclosed herein also have a variety of other uses. For example, 
they also have utility as probes or primers in nucleic acid hybridization embodiments. As 
such, it is contemplated that nucleic acid segments that comprise a sequence region that 
consists of at least a 14 nucleotide long contiguous sequence that has the s^me sequence as, 
or is complementary to, a 14 nucleotide long contiguous sequence of SEQ ID NO:l or SEQ 
ID NO:3 will find particular utility. Longer contiguous identical or complementary 
sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 (including all 
intermediate lengths) and even up to full length sequences will also be of use in certain 
embodiments. 

The ability of such nucleic acid probes to specifically hybridize to LTBP-2 or LTBP- 
3-encoding sequences will enable them to be of use in detecting the presence of 
complementary sequences in a given sample. However, other uses are envisioned, including 
the use of the sequence information for the preparation of mutant species primers, or primers 
for use in preparing other genetic constructions. 

Nucleic acid molecules having sequence regions consisting of contiguous nucleotide 
stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides or so, identical or 
complementary to SEQ ID NO:l or SEQ ID NO: 3, are particularly contemplated as 
hybridization probes for use in, e.g., Southern and Northern blotting. This would allow 
LTBP-2 or LTBP-3 structural or regulatory genes to be analyzed, both in diverse cell types 
and also in various mammalian cells. The total size of fragment, as well as the size of the 
complementary stretch(es), will ultimately depend on the intended use or application of the 
particular nucleic acid segment. Smaller fragments will generally find use in hybridization 
embodiments, wherein the length of the contiguous complementary region may be varied, 
such as between about 10-14 and about 100 nucleotides, but larger contiguous 
complementarity stretches may be used, according to the length complementary sequences 
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one wishes to detect. 



The use of a hybridization probe of about 10-14 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having 
contiguous complementary sequences over stretches greater than 10 bases in length are 
generally preferred, though, in order to increase stability and selectivity of the hybrid, and 
therebv improve the quality and degree of specific hybrid molecules obtained. One will 
generally prefer to design nucleic acid molecules having gene-complementary stretches of 15 
to 20 contiguous nucleotides, or even longer where desired. 

Hybridization probes may be selected from any portion of any of the sequences 
disclosed herein. All that is required is to review the sequence set forth in SEQ ID NO:l or 
SEQ ID NO:3 and to select any continuous portion of the sequence, from about 10-14 
nucleotides in length up to and including the full length sequence, that one wishes to utilize 
as a probe or primer. The choice of probe and primer sequences may be governed by 
various factors, such as, by way of example only, one may wish to employ primers from 
towards the termini of the total sequence. 

The process of selecting and preparing a nucleic acid segment that includes a 
contiguous sequence from within SEQ ID NO:l or SEQ ID NO:3 may alternatively be 
described as preparing a nucleic acid fragment. Of course, fragments may also be obtained 
by other techniques such as, e.g., by mechanical shearing or by restriction enzyme digestion. 
Small nucleic acid segments or fragments may be readily prepared by, for example, directly 
synthesizing the fragment by chemical means, as is commonly practiced using an automated 
oligonucleotide synthesizer. Also, fragments may be obtained by application of nucleic acid 
reproduction technology, such as the PCR™ technology of U.S. Patent 4,603,102 
(incorporated herein by reference), by introducing selected sequences into recombinant 
vectors for recombinant production, and by other recombinant DNA techniques generally 
known to those of skill in the art of molecular biology. 
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Accordingly, the nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of LTBP-2 or LTBP-3 gene 
or cDNA fragments. Depending on the application envisioned, one will desire to employ 
varying conditions of hybridization to achieve varying degrees of selectivity of probe towards 
target sequence. For applications requiring high selectivity, one will typically desire to 
employ relatively stringent conditions to form the hybrids, e.g., one will select relatively low 
salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.15 M 
NaCl at temperatures of 50°C to 70°C. Such selective conditions tolerate little, if any, 
mismatch between the probe and the template or target strand, and would be particularly 
suitable for isolating LTBP-2 or LTBP-3 genes. 

Of course, for some applications, for example, where one desires to prepare mutants 
employing a mutant primer strand hybridized to an underlying template or where one seeks 
to isolate LTBP-2 or LTBP-2 or LTBP-3-encoding sequences from related species, functional 
equivalents, or the like, less stringent hybridization conditions will typically be needed in 
order to allow formation of the heteroduplex. In these circumstances, one may desire to 
employ conditions such as about 0.15 M to about 0.9 M salt, at temperatures ranging from 
20 °C to 55 °C. Cross-hybridizing species can thereby be readily identified as positively 
hybridizing signals with respect to control hybridizations. In any case, it is generally 
appreciated that conditions can be rendered more stringent by the addition of increasing 
amounts of formamide, which serves to destabilize the hybrid duplex in the same manner as 
increased temperature. Thus, hybridization conditions can be readily manipulated, and thus 
will generally be a method of choice depending on the desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences of 
the present invention in combination with an appropriate means, such as a label, for 
determining hybridization. A wide variety of appropriate indicator means are known in the 
art, including fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, 
which are capable of giving a detectable signal. In preferred embodiments, one will likely 
desire to employ a fluorescent label or an enzyme tag, such as urease, alkaline phosphatase 
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or peroxidase, instead of radioactive or other environmental undesirable reagents. In the 
case of enzyme tags, colorimetric indicator substrates are known that can be employed to 
provide a means visible to the human eye or spectrophotometrically, to identify specific 
hybridization with complementary nucleic acid-containing samples. 

In general, it is envisioned that the hybridization probes described herein will be 
useful both as reagents in solution hybridization as well as in embodiments employing a solid 
phase. In embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or 
otherwise affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is 
then subjected to specific hybridization with selected probes under desired conditions. The 
selected conditions will depend on the particular circumstances based on the particular 
criteria required (depending, for example, on the G+C content, type of target nucleic acid, 
source of nucleic acid, size of hybridization probe, etc.). Following washing of the 
hybridized surface so as to remove nonspecifically bound probe molecules, specific 
hybridization is detected, or even quantitated, by means of the label. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood 
by reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

FIG. 1. Overlapping murine cDNA clones representing the LTBP-like (LTBP-2 or 
LTBP-3) sequence. A partial representation of restriction sites is shown. N, Ncol\ P, Pvull; 
R, Rsall; B, BamEl; H, HindllL The numbering system at the bottom assumes that the M A" 
of the initiator Met codon is nt #1 . 

FIG. 2 A. A schematic showing the structure of the murine fibrillin- 1 gene product. 
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Structural domains are shown below the diagram. Symbols designating various structural 
elements are defined in the legend to FIG. 2B. 

FIG. 2B. A schematic showing the structure of the murine LTBP-like (LTBP-2 or 
LTBP-3) molecule. Domains #1-5 are denoted below the diagram. Symbols designate the 
following structural elements: EGF-CB repeats: open rectangles; TGF-bp repeats: open 
ovals; Fib motif: open circle; TGF-bp-like repeat: patterned oval; cysteine-rich sequences: 
patterned rectangles; proline/glycine-rich region: thick curved line, domain #2; proline-rich 
region, thick curved line, domain #3. Note that symbols designating the signal peptide have 
been deleted for simplicity. Additionally, the schematic assumes that EGF-like and EGF-CB 
repeats may extend for several amino acids beyond the C 6 position. 

FIG, 2C. A schematic showing the structure of human LTBP-1. Domains #1-5 are 
denoted below the diagram. The symbols designating the structural elements are defined in 
the legend to FIG. 2B. 

FIG. 3A, FIG. 3B, FIG. 3C, FIG. 3D, FIG. 3E and FIG. 3F. Overview of 
expression of the new LTBP-like (LTBP-2 or LTBP-3) gene during murine development as 
determined by tissue in situ hybridization. FIG. 3A, FIG. 3B, FIG. 3C, FIG. 3D, FIG. 3E 
and FIG. 3F are autoradiograms made by direct exposure of tissue sections to film after 
hybridization with radiolabeled probes. Identical conditions were maintained throughout 
autoradiography and photography, thereby allowing a comparison of the overall strength of 
hybridization in all tissue sections. The transcript is expressed in connective tissue, 
mesenchyme, liver, heart and CNS. 

FIG. 3A. Day 8.5-9.0; sections contain embryos surrounded by intact membranes, 
uterine tissues, and the placental disk, cut in random planes; anti-sense probe. 

FIG. 3B. Day 8.5-9.0; sections contain embryos surrounded by intact membranes, 
uterine tissues, and the placental disk, cut in random planes; sense probe. 
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FIG, 3C. Day 13.5; sections contain isolated whole embryos sectioned in the sagittal 
plane near or about the mid-line; anti-sense probe. 

FIG. 3D. Day 13.5; sections contain isolated whole embryos sectioned in the sagittal 
plane near or about the mid-line; sense probe. 

FIG. 3E. Day 16.5; sections contain isolated whole embryos sectioned in the sagittal 
plane near or about the mid-line, anti-sense probe. 

FIG. 3F. Day 16.5; sections contain isolated whole embryos sectioned in the sagittal 
plane near or about the mid-line, sense probe. 

FIG. 4. Time-dependent expression of the LTBP-3 gene by MC3T3-E1 cells. 
mRNA preparation and Northern blotting were preformed as described in Example XIV. 
Equal aliquots of total RNA as determined by UV spectroscopy were loaded in each lane of 
the Northern gel. As demonstrated by UV spectroscopy were loaded in each lane of the 
Northern gel. As demonstrated by methylene blue staining (Sambrook et aL, 1989), equal 
amounts of RNA were transferred to the nylon membrane. The results demonstrate a clear, 
strong peak in LTBP-3 gene expression by 14 days in culture. Weaker signals denoting 
LTBP-3 gene expression also can be observed after 5 days and 28 days in culture. 

FIG. 5. Antisera #274 specifically binds LTBP-3 epitopes. Transfection of 293T 
cells with a full length mouse LTBP-3 expression plasmid followed by radiolabeling, 
preparation of medium sample, immunoprecipitation, and 4-18% gradient SDS-PAGE were 
performed as described in Example XIV. The figure presents a SDS-PAGE autoradiogram 
of medium samples following a 2 day exposure to film. Lane assignments are as follows: 
Lane 1, radiolabeled 293T medium (prior to transfection) immunoprecipitated with 
preimmune serum; Land 2, radiolabeled 293T medium (prior to transfection) 
immunoprecipitated with antibody #274; Lane 3, radiolabeled 393T medium (following 
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transfection and preincubation with 10 ng of LTBP-3 synthetic peptide cocktail) 
immunoprecipitated with antibody #274; and Lane 4, radiolabeled 293T medium (following 
transfection) immunoprecipitated with antibody #274. As indicated by the bar, the full length 
LTBP-3 molecule migrated at 180-190 kDa. 

FIG. 6. Co-immunoprecipitation of LTBP-3 and TGR-01 produced by MC3T3-E1 
cells. Aliquots ( ~ 10 6 incorporated CPM) of radiolabeled media produced by MC3T3-E1 
cells after 7 days in culture were immunoprecipitated as described in Example XTV. Bars 
indicate the position of cold molecular weight standards used to estimate molecular weight 
(Rainbow mix, Amersham). Immunoprecipitates were separated using 4%-18% gradient 
SDS-PAGE and reducing conditions. The figure shows a negative control lane 1 consisting 
of MC3T3-E1 medium immunoprecipitated with anti-LTBP-3 antibody #274. Western 
blotting was performed using the lower portion of the gradient gel and a commercially 
available antibody to TGF-jSl (Santa Cruz Biotechnology, Inc.). Antibody staining was 
detected using commercially available reagents and protocols (ECL Western Blotting 
Reagent, Amersham). MC3T3-E1 medium was immunoprecipitated with anti-LTBP-2 
antibody #274. 

FIG. 7. DNA sequence of the murine LTBP-2 gene (SEQ ID NO:l). 

FIG. 8. Amino acid sequence of the murine LTBP-2 gene product (SEQ ID NO:2). 

FIG. 9. DNA sequence of the murine LTBP-3 gene (SEQ ID NO:3). 

FIG. 10. Amino acid sequence of the murine LTBP-3 gene product (SEQ ID NO:4). 

FIG. tl. Mouse ltbp-2 cDNA Clones. The schematic figure presents overlapping 
mouse cDNA clones representing the mouse ltbp-2 coding sequence. A partial representation 
of the restriction sites is shown. A, Avrll; N, Nael; Sa, ScaU; X, Xhol; B, BaniSl; C, Clal; 
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and E, EcoBI. 



FIG. 12A. ltbp-2 Gene Expression in the Developing Mouse Skeleton. Shown is an 
overview of the of the ltbp-2 gene during mouse development, as determine by tissue in situ 
hybridization. The figure presents an autoradiogram made by direct exposure of tissue 
sections to film after hybridization with radiolabeled probes, but before dipping slides in 
radiographic emulsion. Day 16.5 p.c. sections contain whole embryos sectioned in the mid- 
sagittal plane. Identical conditions were maintained throughout autoradiography and 
photography, making it possible to compare the overall strength of hybridization with 
antisense (top) and sense (bottom) probes. 1 cm = 20 /xm. 

FIG, 12B. ltbp-2 Gene Expression in the Developing Mouse Skeleton. Shown is a 
selected brightfield microscopic view of mouse ltbp-2 gene expression in cartilage of day 
16.5 p.c. mouse embryos. Photograph was taken from tissue sections following a two week 
exposure to photographic emulsion. 1 cm = 20 jim. 

FIG. liC. ltbp-2 Gene Expression in the Developing Mouse Skeleton. Shown is a 
selected darkfield microscopic view of mouse ltbp-2 gene expression in cartilage of day 16.5 
p.c. mouse embryos. Photograph was taken from tissue sections following a two week 
exposure to photographic emulsion. In all darkfield photographs red blood cell and other 
plasma membranes give a faint white signal that contributes to the background of the study. 
1 cm = 20 fim. 

FIG. / 13. Co-transfection of 293T Cells With pLTBP-3fl and pTGF-/31. 
Immunoprecipitation of LTBP-3 and TGF-|S1 produced by 293T cells following transient 
transfection and radiolabeling. Aliquots (~ 10 6 incorporated CPM) of radiolabeled media 
were immunoprecipitated and separated using 4%-18% gradient SDS-PAGE and either 
reducing or nonreducing conditions as described (Yin et al., 1995). Bars on the left indicate 
the position of cold standards used to estimate molecular weight: 200, 97.4, 69, 46, 30, 
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21.5 and 14.3 kDa (Rainbow mix, Amersham). Lane assignments are as follows: Lane 1, 
293T cell medium proteins immunoprecipitated with 274 antibody (separated under reducing 
conditions) after co-transfection with pLTBP-3fl and pTGF-jSl; Lane 2, 293T cell medium 
proteins immunoprecipitated with 274 antibody (separated under nonreducing conditions) 
after co-transfection with pLTBP-3fl and pTGF-jSl; Lane 3, untransfected 293T cell medium 
proteins imirianGprecipitated with 274 antibody (separated under reducing conditions); Lane 
4, 293T cell medium proteins immunoprecipitated vith 274 antibody (sep?^ated under 
reducing conditions) after transfection with pLTBP-3fl; Lane 5, 293T cell medium proteins 
immunoprecipitated with 40091 antibody (separated under reducing conditions) after 
transfection with pTGF-jSl; Lane 6, 293T cell medium proteins immunoprecipitated with 
40091 antibody (separated under nonreducing conditions) after transfection with pTGF-j31; 
and Lane 7, 293T cell medium proteins immunoprecipitated with 40091 antibody (separated 
under reducing conditions) after co-transfection with pLTBP-3fl and pTGF-j31. Note that the 
signal is weakest in lanes 5-7, in which proteins were immunoprecipitated using the 40091 
antibody, reflecting the weaker affinity of the 40091 antiserum. 



DESCRIPTION OF THE PREFERRED EMBODIMENT 



1. Applications of Bone Repair Technology to Human Treatment 

The following is a brief discussion of four human conditions to exemplify the variety 
of diseases and disorders that would benefit from the development of new technology to 
improve bone repair and healing processes. In addition to the following, several other 
conditions, such as, for example, vitamin D deficiency; wound healing in general; soft 
skeletal tissue repair; and cartilage and tendon repair and regeneration, may also benefit from 
technology concerning the stimulation of bone progenitor cells. 

The first example is the otherwise healthy individual who suffers a fracture. Often, 
clinical bone fracture is treated by casting to alleviate pain and allow natural repair 
mechanisms to repair the wound. While there has been progress in the treatment of fracture 
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in recent times, even without considering the various complications that may arise in treating 
fractured bones, any new procedures to increase bone healing in normal circumstances would 
represent a great advance. 

A second example which may benefit from new treatment methods is osteogenesis 
imperfecta (01). 01 encompasses various inherited connective tissue diseases that involve 
bone ?nd soft connective tissue fragility in humans (3yers and Steiner, 1992; Prockop, 
1990). About one child per 5,000-20,000 born is affected with 01 and the disease is 
associated with significant morbidity throughout life. A certain number of deaths also occur, 
resulting in part from the high propensity for bone fracture and the deformation of abnormal 
bone after fracture repair (01 types II-IV; Bonadio and Goldstein, 1993). The relevant issue 
here is quality of life; clearly, the lives of affected individuals would be improved by the 
development of new therapies designed to stimulate and strengthen the fracture repair 
process. 

OI type I is a mild disorder characterized by bone fracture without deformity, blue 
sclerae, normal or near normal stature, and autosomal dominant inheritance (Bonadio and 
Goldstein, 1993). Osteopenia is associated with an increased rate of lone bone fracture upon 
ambulation (the fracture frequency decreases dramatically at puberty and during young adult 
life, but increases once again in late middle age). Hearing loss, which often begins in the 
second or third decade, is a feature of this disease in about half the families and can progress 
despite the general decline in fracture frequency. Dentinogenesis imperfecta is observed in a 
subset of individuals. 

In contrast, 01 types II- VI represent a spectrum of more severe disorders associated 
with a shortened life-span. 01 type II, the perinatal lethal form, is characterized by short 
stature, a soft calvarium, blue sclerae, fragile skin, a small chest, floppy appearing lower 
extremities (due to external rotation and abduction of the femurs), fragile tendons and 
ligaments, bone fracture with severe deformity, and death in the perinatal period due to 
respiratory insufficiency. Radiographic signs of bone weakness include compression of the 
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femurs, bowing of the tibiae, broad and beaded ribs, and calvarial thinning. 

01 type III is characterized by short stature, a triangular facies, severe scoliosis, and 
bone fracture with moderate deformity. Scoliosis can lead to emphysema and a shortened 
life-span due to respiratory insufficiency. 01 type IV is characterized by normal sclerae, 
bone fracture with mild to moderate deformity, tooth defects, and a natural history that 
essentially is intermediate between 01 type II and 01 type L 

More than 200 OI mutations have been characterized since 1989 (reviewed in Byers 
and Steiner, 1992; Prockop, 1990). The vast majority occur in the COL1A1 and COL1A2 
genes of type I collagen. Most cases of 01 type I appear to result from heterozygous 
mutations in the C0L1A1 gene that decrease collagen production but do not alter primary 
structure, i.e., heterozygous null mutations affecting C0L1A1 expression. Most cases of 01 
types II-IV result from heterozygous mutations in the C0L1A1 and C0L1A2 genes that alter 
the structure of collagen. 

A third important example is osteoporosis. The term osteoporosis refers to a 
heterogeneous group of disorders characterized by decreased bone mass and fractures. An 
estimated 20-25 million people are at increased risk for fracture because of site-specific bone 
loss. Risk factors for osteoporosis include increasing age, gender (more females), low bone 
mass, early menopause, race (Caucasians), low calcium intake, reduced physical activity, 
genetic factors, environmental factors (including cigarette smoking and abuse of alcohol or 
caffeine), and deficiencies in neuromuscular control that create a propensity to fall. 

More than a million fractures in the USA each year can be attributed to osteoporosis, 
and in 1986 alone the treatment of osteoporosis cost an estimated 7-10 billion health care 
dollars. Demographic trends (i.e., the gradually increasing age of the US population) 
suggest that these costs may increase 2-3 fold by the year 2020 if a safe and effective 
treatment is not found. Clearly, osteoporosis is a significant health care problem. 
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Clinically, osteoporosis is segregated into type I and type IL Type I osteoporosis 
occurs predominantly in middle aged women and is associated with estrogen loss at the 
menopause, while osteoporosis type II is associated with advancing age. Much of the 
morbidity and mortality associated with osteoporosis results from immobilization of elderly 
patients following fracture. 

Current therapies for osteoporosis patients focus on fracture prevention, not fracture 
repair. This remains an important consideration because of the literature, which clearly 
states that significant morbidity and mortality are associated with prolonged bed rest in the 
elderly, particularly those who have suffered hip fractures. Complications of bed rest 
include blood clots and pneumonia. These complications are recognized and measures are 
usually taken to avoid them, but these measures hardly represent the best approach to 
therapy. Thus, the osteoporotic patient population would benefit from new therapies 
designed to strengthen bone and speed up the fracture repair process, thereby getting these 
people on their feet before the complications arise. 

A fourth example is related to bone reconstruction and, specifically, the ability to 
reconstruct defects in bone tissue that result from traumatic injury; cancer or cancer surgery; 
birth defect; a developmental error or heritable disorder; or aging. There is a significant 
orthopaedic need for more stable total joint implants, and cranial and facial bone are 
particular targets for this type of reconstructive need. The availability of new implant 
materials, e.g., titanium, has permitted the repair of relatively large defects. Titanium 
implants provide excellent temporary stability across bony defects. However, experience has 
shown that a lack of viable bone bridging the defect can result in exposure of the appliance, 
infection, structural instability and, ultimately, failure to repair the defect. 

Autologous bone grafts are another possible reconstructive modality, but they have 
several demonstrated disadvantages in that they must be harvested from a donor site such as 
iliac crest or rib, they usually provide insufficient bone to completely fill the defect, and the 
bone that does form is sometimes prone to infection and resorption. Partially purified 
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xenogeneic preparations are not practical for clinical use because microgram quantities are 
purified from kilograms of bovine bone, making large scale commercial production both 
costly and impractical. Allografts and demineralized bone preparations are therefore often 
employed. 

Microsurgical transfers of free bone grafts with attached soft tissue and blood vessels 
can close bony defects with an immediate source of blood supply to the graft. However, 
these techniques are time consuming, have been shown to produce a great deal of morbidity, 
and can only be used by specially trained individuals. Furthermore, the bone implant is 
often limited in quantity and is not readily contoured. In the mandible, for example, the 
majority of patients cannot wear dental appliances using presently accepted techniques (even 
after continuity is established), and thus gain little improvement in the ability to masticate. 
Toriumi et al , have written that, "reconstructive surgeons should have at their disposal a 
bone substitute that would be reliable, biocompatible, easy to use, and long lasting and that 
would restore mandibular continuity with little associated morbidity." 

In connection with bone reconstruction, specific problem areas for improvement are 
those concerned with treating large defects, such as created by trauma, birth defects, or 
particularly, following tumor resection. The success of orthopaedic implants, interfaces and 
artificial joints could conceivably be improved if the surface of the implant, or a functional 
part of an implant, were to be coated with a bone stimulatory agent. The surface of implants 
could be coated with one or more appropriate materials in order to promote a more effective 
interaction with the biological site surrounding the implant and, ideally, to promote tissue 
repair. 

2. Bone Repair 

Bone tissue is known to have the capacity for repair and regeneration and there is a 
certain understanding of the cellular and molecular basis of these processes. The initiation of 
new bone formation involves the commitment, clonal expansion, and differentiation of 
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progenitor cells. Once initiated, bone formation is promoted by a variety of polypeptide 
growth factors, Newly formed bone is then maintained by a series of local and systemic 
growth and differentiation factors. 

The concept of specific bone growth-promoting agents is derived from the work of 
Huggins and Urist. Huggins et al. , 1936, demonstrated that autologous transplantation of 
canine incisor tooth to skeletal muscle resulted in local new bone formation (Huggins et al. , 
1936). Urist and colleagues reported that demineralized lyophilized bone segments induced 
bone formation (Urist, 1965; Urist et al, 1983), a process that involved macrophage 
chemotaxis; the recruitment of progenitor cells; the formation of granulation tissue, cartilage, 
and bone; bone remodeling; and marrow differentiation. The initiation of cartilage and bone 
formation in an extraskeletal site, a process referred to as osteoinduction, has permitted the 
unequivocal identification of initiators of bone morphogenesis (Urist, 1965; Urist et al. , 
1983; Sampath et al, 1984; Wang et al, 1990; Cunningham et al, 1992). 

Significant progress has now been made in characterizing the biological agents 
elaborated by active bone tissue during growth and natural bone healing. Demineralized 
bone matrix is highly insoluble; Sampath and Reddi (1981) showed that only 3% of the 
proteins can be extracted using strong combinations of denaturants and detergents. They also 
showed that the unfractionated demineralized bone extract will initiate bone morphogenesis, a 
critical observation that led to the purification of "osteoinductive" molecules. Families of 
proteinaceous osteoinductive factors have now been purified and characterized. They have 
been variously referred to in the literature as bone morphogenetic or morphogenic proteins 
(BMPs), osteogenic bone inductive proteins or osteogenic proteins (OPs). 

3. Bone Repair and Growth Factors and Cytokines 

Following their initial purification, several bone morphogenetic protein genes have 
now been cloned using molecular techniques (Wozney et al. , 1988; Rosen et al , 1989; 
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summarized in Alper, 1994). This work has established BMPs as members of the 
transforming growth factor-/? (TGF-/3) superfamily based on DNA sequence homologies. 
Other TGF molecules have also been shown to participate in new bone formation, and TGF- 
0 is regarded as a complex multifunctional regulator of osteoblast function (Centrella et al , 
1988; Carrington et al, 1988; Seitz et al, 1992). Indeed, the family of transforming growth 
factors (TGF-/31, TGF-02, and TGF-/33) has been proposed as potentially useful in the 
treatment of bone disease (U.S. Patent 5,125,978, incorporated herein by reference). 

Transforming growth factors (TGFs) have a central role in regulating tissue healing 
by affecting cell proliferation, gene expression, and matrix protein synthesis (Roberts and 
Sporn, 1989). While not necessarily a direct effect, evidence has been provided that TGF-/81 
and TGF-/32 can initiate both chondrogenesis and osteogenesis (Joyce et al, 1990; Izumi et 
al, 1992; Jingushi et al. , 1992). In these studies new cartilage and bone formation appeared 
to be dose dependent {i.e. , dependent on the local growth factor concentration). The data 
also suggested that TGF-jSl and TGF-02 stimulated cell differentiation by a similar 
mechanism, even though they differed in terms of the ultimate amount of new cartilage and 
bone that was formed. 

Other growth factors/hormones besides TGF and BMP may influence new bone 
formation following fracture. Bolander and colleagues injected recombinant acidic fibroblast 
growth factor into a rat fracture site (Jingushi et al , 1990). The major effect of multiple 
high doses (1.0 mg/50 ml) was a significant increase in cartilage tissue in the fracture gap, 
while lower doses had no effect. These investigators also used the reverse transcriptase- 
polymerase chain reaction (PCR™) technique to demonstrate expression of estrogen receptor 
transcripts in callus tissue (Boden et al, 1989). These results suggested a role for estrogen 
in normal fracture repair. 

Horowitz and colleagues have shown that activated osteoblasts will synthesize the 
cytokine, macrophage colony stimulating factor (Horowitz et al, 1989). The osteotropic 
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agents used in this study included lipopolysaccharide, PTH1-84, PTH1-34, vitamin D and 
all-trans retinoic acid. This observation has led to the suggestion that osteoblast activation 
following fracture may lead to the production of cytokines that regulate both hematopoiesis 
and new bone formation. Various other proteins and polypeptides that have been found to be 
expressed at high levels in osteogenic cells, such as, e.g., the polypeptide designated Vgr-1 
(Lyons et al. , 1989), also have potential for use in connection with the present invention. 

4. Protein Administration and Bone Repair 

Several studies have been conducted in which preparations of protein growth factors, 
including BMPs, have been administered to animals in an effort to stimulate bone growth. 
The results of four such exemplary studies are described blow. 

Toriumi et al. , studied the effect of recombinant BMP-2 on the repair of surgically 
created defects in the mandible of adult dogs (Toriumi et al. , 1991). Twenty-six adult 
hounds were segregated into three groups following the creation of a 3 cm full thickness 
mandibular defect: 12 animals received test implants composed of inactive dog bone matrix 
carrier and" human BMP-2, 10 animals received control implants composed of carrier without 
BMP-2, and BMP-4 animals received no implant. The dogs were euthanized at 2.5-6 
months, and the reconstructed segments were analyzed by radiography, histology, 
histomorphometry, and biomechanical testing. Animals that received test implants were 
euthanized after 2.5 months because of the presence of well mineralized bone bridging the 
defect. The new bone allowed these animals to chew a solid diet, and the average bending 
strength of reconstructed mandibles was 27% of normal ('normal' in this case represents the 
unoperated, contralateral hemimandible). In contrast, the implants in the other two groups 
were non-functional even after 6 months and showed niinimal bone formation. 

Yasko et al. , published a related study in which the effect of BMP-2 on the repair of 
segmental defects in the rat femur was examined (Yasko et al. , 1992). The study design 
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included a group that received a dose of 1.4 rug of BMP-2, another group that received 11.0 
mg of BMP-2, and a control group that received carrier matrix alone. Endochondral bone 
formation was observed in both groups of animals that received BMP-2. As demonstrated by 
radiography, histology, and whole bone (torsion) tests of mechanical integrity, the larger 
dose resulted in functional repair of the 5-mm defect beginning 4.5 weeks after surgery. The 
lower dose resulted in radiographic and histological evidence of new bone formation, but 
functional union was not observed even a*ter 9 weel:s post surgery. There was also no 
evidence of bone formation in control animals at this time. 

Chen et al., showed that a single application of 25-100 mg of recombinant TGF-01 
adjacent to cartilage induced endochondral bone formation in the rabbit ear full thickness skin 
wounds (Chen et al. , 1991). Bone formation began 21 days following the creation of the 
wound and reached a peak at day 42, as demonstrated by morphological methods. Active 
bone remodeling was observed beyond this point. 

In a related study, Beck et al, demonstrated that a single application of TGF-01 in a 
3% methylcellulose gel was able to repair surgically induced large skull defects that 
otherwise heal by fibrous connective tissue and never form bone (Beck et al, 1991). Bony 
closure was achieved within 28 days of the application of 200 mg of TGF-01 and the rate of 
healing was shown to be dose dependent. 

Studies such as those described above have thus established that exogenous growth 
factors can be used to stimulate new bone formation/repair/regeneration in vivo. Certain 
U.S. Patents also concern methods for treating bone defects or inducing bone formation. For 
example, U.S. Patent 4,877,864 relates to the administration of a therapeutic composition of 
bone inductive protein to treat cartilage and/or bone defects; U.S. Patent 5,108,753 concerns 
the use of a device containing a pure osteogenic protein to induce endochondral bone 
formation and for use in periodontal, dental or craniofacial reconstructive procedures. 
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5. Recombinant Expression 

The use of recombinant expression systems in the preparation of LTBP-2 and LTBP-3 
polypeptides is particularly contemplated. To express a recombinant LTBP-2 and LTBP-3 
polypeptides, whether mutant or wild-type, in accordance with the present invention one 
would prepare an expression vector that comprises an LTBP-2- or LTBP-3-encoding nucleic 
acid segment un^er the control of one or more promoters. The "upstream" promoters 
stimulate transcription of the DNA and promote expression of the encoded recombinant 
protein. This is the meaning of "recombinant expression" in this context. 

Many standard techniques are available to construct expression vectors containing the 
appropriate nucleic acids and transcriptional/translational control sequences in order to 
achieve protein or peptide expression in a variety of host-expression systems. Cell types 
available for expression include, but are not limited to, bacteria, such as E. coli and 
B. subtilis transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors. 

Certain examples of prokaryotic hosts are E. coli strain RR1, E. coli LE392, 
E. coli B, E. coli X 1776 (ATCC No. 31537) as well as E. coli W3110 (F-, lambda-, 
prototrophic, ATCC No. 273325); bacilli such as Bacillus subtilis; and other 
enterobacteriaceae such as Salmonella typhimurium, Serratia marcescens, and various 
Pseudomonas species. 

In general, plasmid vectors containing replicon and control sequences which are 
derived from species compatible with the host cell are used in connection with these hosts. 
The vector ordinarily carries a replication site, as well as marking sequences which are 
capable of providing phenotypic selection in transformed cells. For example, E. coli is often 
transformed using pBR322, a plasmid derived from an E. coli species. pBR322 contains 
genes for ampicillin and tetracycline resistance and thus provides easy means for identifying 
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transformed cells. The pBR plasmid, or other microbial plasmid or phage must also contain, 
or be modified to contain, promoters which can be used by the microbial organism for 
expression of its own proteins. 

In addition, phage vectors containing replicon and control sequences that are 
compatible with the host microorganism can be used as traiisforming vectors in connection 
with these hosts. For example, the phage lambda GEM-11™ may be utilized in making a 
recombinant phage vector which can be used to transform host cells, such as E. colt LE392. 
Further useful vectors include pIN vectors (Inouye et al, 1985); and pGEX vectors, for use 
in generating glutathione S-transferase (GST) soluble fusion proteins for later purification and 
separation or cleavage. 

Promoters that are most commonly used in recombinant DNA construction include the 
/3-lactamase (peniciUinase), lactose and tryptophan (trp) promoter systems. While these are 
the most commonly used, other microbial promoters have been discovered and utilized, and 
details concerning their nucleotide sequences have been published, enabling those of skill in 
the art to ligate them functionally with plasmid vectors. 

For expression in Saccharomyces, the plasmid YRp7, for example, is commonly used 
(Stinchcomb et al, 1979; Kingsman et al, 1979; Tschemper et al, 1980). This plasmid 
already contains the trpL gene which provides a selection marker for a mutant strain of yeast 
lacking the ability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1 (Jones, 
1977). The presence of the trpL lesion as a characteristic of the yeast host cell genome then 
provides an effective environment for detecting transformation by growth in the absence of 
tryptophan. 

Suitable promoting sequences in yeast vectors include the promoters for 3- 
phosphoglycerate kinase (Hitzeman et al, 1980) or other glycolytic enzymes (Hess et al, 
1968; Holland et al, 1978), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, 
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hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase, and glucokinase. In constructing suitable expression plasmids, the termination 
sequences associated with these genes are also ligated into the expression vector 3' of the 
sequence desired to be expressed to provide polyadenylation of the mRNA and termination. 

Other suitable promoters, which have the additional advantage of transcription 
controlled by growth conditions, include the promoter region for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen 
metabolism, and the aforementioned glyceraldehyde-3-phosphate dehydrogenase, and 
enzymes responsible for maltose and galactose utilization. 

In addition to micro-organisms, cultures of cells derived from multicellular organisms 
may also be used as hosts. In principle, any such cell culture is workable, whether from 
vertebrate or invertebrate culture. In addition to mammalian cells, these include insect cell 
systems infected with recombinant virus expression vectors (e.g., baculovirus); and plant cell 
systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression 
vectors (e.g., Ti plasmid) containing one or more LTBP-2- or LTBP-3- encoding DNA 
sequences. 

In a useful insect system, Autograph californica nuclear polyhidrosis virus (AcNPV) 
is used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells 
The LTBP coding sequences are cloned into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for example 
the polyhedrin promoter). Successful insertion of the coding sequences results in the 
inactivation of the polyhedrin gene and production of non-occluded recombinant virus (i.e., 
virus lacking the proteinaceous coat coded for by the polyhedrin gene). These recombinant 
viruses are then used to infect Spodoptera frugiperda cells in which the inserted gene is 
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expressed (e.g., U.S. Patent No. 4,215,051 issued to Smith). 



Examples of useful mammalian host cell lines are VERO and HeLa cells, Chinese 
hamster ovary (CHO) cell lines, W138, BHK, COS-7, 293, HepG2, 3T3, RIN and MDCK 
cell lines. In addition, a host cell strain may be chosen that modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion 
desired. Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein 
products may be important for the function of the protein, particularly with respect to the 
ability of LTBP-2 and LTBP-3 polypeptides to bind to TGF/3 proteins. 

Different host cells have characteristic and specific mechanisms for the post- 
translational processing and modification of proteins. Appropriate cells lines or host systems 
can be chosen to ensure the correct modification and processing of the foreign protein 
expressed. To this end, eukaryotic host cells may also be used if desired, with a cell that 
allows for high-level expression of LTBP-2 and LTBP-3 polypeptides being preferred. 

Expression vectors for use in mammalian such cells ordinarily include an origin of 
replication (as necessary), a promoter located in front of the gene to be expressed, along with 
any necessary ribosome binding sites, RNA splice sites, polyadenylation site, and 
transcriptional terminator sequences. The origin of replication may be provided either by 
construction of the vector to include an exogenous origin, such as may be derived from SV40 
or other viral (e.g., Polyoma, Adeno, VSV, BPV) source, or may be provided by the host 
cell chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter is often sufficient. 

The promoters may be derived from the genome of mammalian cells (e.g., 
metallothionein promoter) or from mammalian viruses (e.g., the adenovirus late promoter; 
the vaccinia virus 7.5K promoter). Further, it is also possible, and may be desirable, to 
utilize promoter or control sequences normally associated with the desired LTBP-encoding 
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gene sequence, provided such control sequences are compatible with the host cell systems. 

A number of viral based expression systems may be utilized, for example, commonly 
used promoters are derived from polyoma, Adenovirus 2, and most frequently Simian Virus 
40 (SV40). The early and late promoters of SV40 virus are particularly useful because both 
are obtained easily from the virus as a fragment which also contains the SV40 viral origin of 
replication. Smaller or larger SV40 fragments may also be used, provided there is included 
the approximately 250 bp sequence extending from the Hind III site toward the Bgll site 
located in the viral origin of replication. 

In cases where an adenovirus is used as an expression vector, the coding sequences 
may be ligated to an adenovirus transcription/ translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric gene may then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region 
of the viral genome (e.g., region El or E3) will result in a recombinant virus that is viable 
and capable of expressing the LTBP polypeptides in infected hosts. 

Specific initiation signals may also be required for efficient translation of LTBP-2 and 
LTBP-3 coding sequences. These signals include the ATG initiation codon and adjacent 
sequences. Exogenous translational control signals, including the ATG initiation codon, may 
additionally need to be provided. One of ordinary skill in the art would readily be capable of 
determining this and providing the necessary signals. It is well known that the initiation 
codon must be in-frame (or in-phase) with the reading frame of the desired coding sequence 
to ensure translation of the entire insert. These exogenous translational control signals and 
initiation codons can be of a variety of origins, both natural and synthetic. The efficiency of 
expression may be enhanced by the inclusion of appropriate transcription enhancer elements, 
transcription terminators (Bittner et al 9 1987). 

For long-term, high-yield production of recombinant LTBP-2 or LTBP-3 proteins, 
stable expression is preferred. For example, cell lines that stably express constructs 
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encoding LTBP-2 or LTBP-3 polypeptides may be engineered. Rather than using expression 
vectors that contain viral origins of replication, host cells can be transformed with vectors 
controlled by appropriate expression control elements {e.g., promoter, enhancer, sequences, 
transcription terminators, polyadenylation sites, etc.), and a selectable marker. Following 
the introduction of foreign DNA, engineered cells may be allowed to grow for 1-2 days in an 
enriched media, and then are switched to a selective media. The selectable marker in the 
recombinant plasmid confers resistance to the selection and allows cells to stably integrate the 
plasmid into their chromosomes and grow to form foci which in turn can be cloned and 
expanded into cell lines. 

A number of selection systems may be used, including, but not limited, to the herpes 
simplex virus thymidine kinase (Wigler et al, 1977), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska et al , 1962) and adenine phosphoribosyltransferase 
genes (Lowy et al, 1980), in tk-, hgprt- or aprt- cells, respectively. Also, antimetabolite 
resistance can be used as the basis of selection for dhfr, that confers resistance to 
methotrexate (Wigler et al, 1980; O'Hare et al, 1981); gpt, that confers resistance to 
mycophenolic acidJMulligan etaL, 1981); neo, that confers resistance to the aminoglycoside 
G-418 (Colberre-Garapin et al, 1981); and hygro, that confers resistance to hygromycin 
(Santerre et al, 1984). 

6. Site-Specific Mutagenesis 

Site-specific mutagenesis is a technique useful in the preparation of individual 
peptides, or biologically functional equivalent proteins or peptides, through specific 
mutagenesis of the underlying DNA. The technique further provides a ready ability to 
prepare and test sequence variants, for example, incorporating one or more of the foregoing 
considerations, by introducing one or more nucleotide sequence changes into the DNA. Site- 
specific mutagenesis allows the production of mutants through the use of specific 
oligonucleotide sequences which encode the DNA sequence of the desired mutation, as well 
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as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size 
and sequence complexity to form a stable duplex on both sides of the deletion junction being 
traversed. Typically, a primer of about 17 to 25 nucleotides in length is preferred, with 
about 5 to 10 residues on both sides of the junction of the sequence being altered. 

In general, the technique of site-specific mutagenesis is well known in the art, as 
exemplified by various publications. As will be appreciated, the technique typically employs 
a phage vector which exists in both a single stranded and double stranded form. Typical 
vectors useful in site-directed mutagenesis include vectors such as the M13 phage. These 
phage are readily commercially available and their use is generally well known to those 
skilled in the art. Double stranded plasmids are also routinely employed in site directed 
mutagenesis which eliminates the step of transferring the gene of interest from a plasmid to a 
phage. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector or melting apart of two strands of a double stranded vector 
which includes within its sequence a DNA sequence which encodes the desired osteotropic 
protein. An oligonucleotide primer bearing the desired mutated sequence is prepared, 
generally synthetically. This primer is then annealed with the single-stranded vector, and 
subjected to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, in 
order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is 
formed wherein one strand encodes the original non-mutated sequence and the second strand 
bears the desired mutation. This heteroduplex vector is then used to transform appropriate 
cells, such as E. coli cells, and clones are selected which include recombinant vectors 
bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected LTBP-2 or LTBP-3 gene using 
site-directed mutagenesis is provided as a means of producing potentially useful species and 
is not meant to be limiting as there are other ways in which sequence variants of osteotropic 
genes may be obtained. For example, recombinant vectors encoding the desired osteotropic 
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gene may be treated with mutagenic agents, such as hydroxy lamine, to obtain sequence 
variants. 

7. In Situ Hybridization 

The following technique describes the detection of mRNA in tissue obtained from the 
site of bone regeneration. This may be refill for detecting expression of T TBP-2 or LTBP- 
3. 

DNA from a plasmid containing the gene for which mRNA is to be detected is 
linearized, extracted, and precipitated with ethanol. Sense and antisense transcripts are 
generated from 1 mg template with T3 and T7 polymerases, e.g., in the presence of [ 35 S] 
UTP at >6 mCi/ml (Amersham Corp., > 1200 Ci/mmol) and 1.6 U/ml RNasin (Promega), 
with the remaining in vitro transcription reagents provided in a kit (SureSite, Novagen Inc.). 
After transcription at 37 °C for 1 hour, DNA templates are removed by a 15 minute digestion 
at 37°C with 0.5 U/ml RNase-free DNase I, extracted, and precipitated with ethanol. 
Riboprobes are hydrolyzed to an average final length of 150 bp by incubating in 40 mM 
NaHC0 3 , 60 mM Na^C^, 80 mM DTT at 60°C, according to previously determined 
formula. Hydrolysis is terminated by addition of sodium acetate, pH 6.0, and glacial acetic 
acid to 0.09 M and 0.005% (v/v), respectively, and the probes are then ethanol precipitated, 
dissolved in 0.1 M DTT, counted, and stored at -20°C until use. 

RNase precautions are taken in all stages of slide preparation. Bouins fixed, paraffin 
embedded tissue sections are heated to 65 °C for 10 minutes, deparaffinized in 3 changes of 
xylene for 5 minutes, and rehydrated in a descending ethanol series, ending in phosphate- 
buffered saline (PBS). Slides will be soaked in 0.2 N HC1 for 5 min., rinsed in PBS, 
digested with 0.0002% proteinase K in PBS for 30 minutes at 37 °C and rinsed briefly with 
DEPC-treated water. After equilibrating for 3 minutes in 0.1 M triethanolamine-HCl (TEA- 
HC1), pH 8.0, sections are acetylated in 0.25% (v/v) acetic anhydride in 0.1 M TEA-HC1 for 
10 minutes at room temperature, rinsed in PBS, and dehydrated in an ascending ethanol 
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series. Each section receives 100-200 ml prehybridization solution (0.5 mg/ml denatured 
RNase-free tRNA (Boehringer-Mannheim), 10 mM DTT, 5 mg/ml denatured, sulfurylated 
salmon sperm DNA, 50% formamide, 10% dextran sulfate, 300 mM NaCl, IX RNase-free 
Denhardt's solution (made with RNase-free bovine serum albumin, Sigma), 10 mM Tris- 
HC1, pH 7.4, 1 mM EDTA) and then incubate on a 50°C slide warmer in a humidified 
enclosure for 2 hours. The sulfurylated salmon-sperm DNA blocking reagent is used in both 
prehybridization md hybridization solutions to help reduce nonspecific binding to tissue by 
35 SH groups on the probe. It is prepared by labeling RNase-free salmon sperm DNA 
(Sigma) with non-radioactive a-thio-dCTP and a-thio-dATP (Amersham) in a standard 
random oligonucleotide-primed DNA labeling reaction. Excess prehybridization solution is 
removed with a brief rinse in 4X SSC before application of probe. 

Riboprobes, fresh tRNA and sulfurylated salmon sperm DNA will be denatured for 10 
minutes at 70°C, and chilled on ice. Hybridization solution, identical to prehybridization 
solution except with denatured probe added to 5 X 10 6 CPM/ml, is applied and slides 
incubated at 50°C overnight in sealed humidified chambers on a slide warmer. Sense and 
antisense probes are applied to serial sections. Slides are rinsed 3 times in 4X SSC, washed 
with 2X SSC, 1 mM DTT for 30 min. at 50°C, digested with RNase A (20 mg/ml RNase A, 
0.5 M NaCl, 10 mM Tris, pH 8.0, 1 mM EDTA, pH 8.0) for 30 min. at 37°C, and rinsed 
briefly with 2X SSC, 1 mM DTT. Three additional washes are performed, each at 50°C for 
30 minutes: once in 2X SSC, 50% formamide, 1 mM DTT, and twice in IX SSC, 0.13% 
(w/v) sodium pyrophosphate, 1 mM DTT. 

Slides are dehydrated in an ascending ethanol series (with supplementation of the 
dilute ethanols (50% and 70%) with SSC and DTT to 0.1X and 1 mM, respectively). Slides 
are exposed to X-ray film for 20-60 hours to visualize overall hybridization patterns, dipped 
in autoradiographic emulsion (Kodak NTB-2, diluted to 50% with 0.3 M ammonium acetate), 
slowly dried for 2 hours, and exposed (4°C) for periods ranging from 8 days to 8 weeks. 
After developing emulsion, sections are counter strained with hematoxylin and eosin, 
dehydrated, and mounted with xylene-based medium. The hybridization signal is visualized 
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under darkfield microscopy. 

The above in situ hybridization protocol may be used, for example, in detecting the 
temporal and spatial pattern of PTH/PTHrP receptor expression. A suitable rat PTH/PTHrP 
receptor cDNA probe (R15B) is one that consists of a 1810 bp region encoding the full 
length rat bone PTH/PTHrP receptor (Abou-Samra et a/., 1992). The cDNA fragment is 
subcloned into p-DNA 1 (Invitrogen Corp., San Diego, CA) and is cut out using Xbal and 
BamHl. This probe has provided positive signals for northern blot analysis of rat, murine, 
and human osteoblastic cell lines, rat primary calvarial cells, and murine bone tissue. The 
pcDNA I plasmid contains a T7 and SP6 promoter that facilitate the generation of cRNA 
probes for in situ hybridization. The full length transcript has been used to detect 
PTH/PTHrP receptor in sections of bone (Lee et al , 1994). The PTHrP cDNA probe 
(Yasuda et al, 1989) is a 400 bp subcloned fragment in pBluescript (Stratagene). This probe 
has been used for in situ hybridization, generating an antisense cRNA probe using BamHl 
cleavage and the T3 primer and a sense cRNA probe using EcoKl cleavage and the 17 
primer. 

8. Monoclonal Antibody Generation 

Means for preparing and characterizing antibodies are well known in the art (See, 
e.g., Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988; incorporated 
herein by reference). 

The methods for generating monoclonal antibodies (MAbs) generally begin along the 
same lines as those for preparing 'polyclonal antibodies. Briefly, a polyclonal antibody is 
prepared by immunizing an animal with an immunogenic composition in accordance with the 
present invention and collecting antisera from that immunized animal. A wide range of 
animal species can be used for the production of antisera. Typically the animal used for 
production of anti-antisera is a rabbit, a mouse, a rat, a hamster, a guinea pig or a goat. 
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Because of the relatively large blood volume of rabbits, a rabbit is a preferred choice for 
production of polyclonal antibodies. 

As is well known in the art, a given composition may vary in its immunogenicity. It 
is often necessary therefore to boost the host immune system, as may be achieved by 
coupling a peptide or polypeptide immunogen to a carrier. Exemplary and preferred carriers 
are keyhole limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other albumins 
such as ovalbumin, mouse serum albumin or rabbit serum albumin can also be used as 
carriers. Means for conjugating a polypeptide to a carrier protein are well known in the art 
and include glutaraldehyde, m-maleimidobencoyl-N-hydroxysuccmimide ester, carbodiimyde 
and bis-biazotized benzidine. 

As is also well known in the art, the immunogenicity of a particular immunogen 
composition can be enhanced by the use of non-specific stimulators of the immune response, 
known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuvant 
(a non-specific stimulator of the immune response containing killed Mycobacterium 
tuberculosis), incomplete Freund's adjuvants and aluminum hydroxide adjuvant. 

The amount of immunogen composition used in the production of polyclonal 
antibodies varies upon the nature of the immunogen as well as the animal used for 
immunization. A variety of routes can be used to administer the immunogen (subcutaneous, 
intramuscular, intradermal, intravenous and intraperitoneal). The production of polyclonal 
antibodies may be monitored by sampling blood of the immunized animal at various points 
following immunization. A second, booster injection, may also be given. The process of 
boosting and titering is repeated until a suitable titer is achieved. When a desired level of 
immunogenicity is obtained, the immunized animal can be bled and the serum isolated and 
stored, and/or the animal can be used to generate MAbs. 

MAbs may be readily prepared through use of well-known techniques, such as those 
exemplified in U.S. Patent 4,196,265, incorporated herein by reference. Typically, this 
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technique involves immunizing a suitable animal with a selected immunogen composition, 
e.g., a purified or partially purified LTBP-2 or LTBP-3 protein, polypeptide or peptide. The 
immunizing composition is administered in a manner effective to stimulate antibody 
producing cells. Rodents such as mice and rats are preferred animals, however, the use of 
rabbit, sheep frog cells is also possible. The use of rats may provide certain advantages 
(Goding, 1986, pp. 60-61), but mice are preferred, with the BALB/c mouse being most 
preferred as this is most routinely used and generally gives a higher percentage of stable 
fusions. 

Following immunization, somatic cells with the potential for producing antibodies, 
specifically B lymphocytes (B cells), are selected for use in the MAb generating protocol. 
These cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a 
peripheral blood sample. Spleen cells and peripheral blood cells are preferred, the former 
because they are a rich source of antibody-producing cells that are in the dividing plasmablast 
stage, and the latter because peripheral blood is easily accessible. Often, a panel of animals 
will have been immunized and the spleen of animal with the highest antibody titer will be 
removed and the spleen lymphocytes obtained by homogenizing the spleen with a syringe. 
Typically, a spleen from an immunized mouse contains approximately 5 X 10 7 to 2 X 10 8 
lymphocytes. 

The antibody-producing B lymphocytes from the immunized animal are then fused 
with cells of an immortal myeloma cell, generally one of the same species as the animal that 
was immunized. Myeloma cell lines suited for use in hybridoma-producing fusion 
procedures preferably are non-antibody-producing, have high fusion efficiency, and enzyme 
deficiencies that render then incapable of growing in certain selective media which support 
the growth of only the desired fused cells (hybridomas). 

Any one of a number of myeloma cells may be used, as are known to those of skill in 
the art (Goding, pp. 65-66, 1986; Campbell, pp. 75-83, 1984). For example, where the 
immunized animal is a mouse, one may use P3-X63/Ag8, X63-Ag8.653, NSl/l.Ag 4 1, 
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Sp210-Agl4, FO, NSO/U, MPC-11, MPC11-X45-GTG 1.7 and S194/5XX0 Bui; for rats, 
one may use R210.RCY3, Y3-Ag 1.2.3, IR983F and 4B210; and U-266, GM1500-GRG2, 
LICR-LON-HMy2 and UC729-6 are all useful in connection with human cell fusions. 

One preferred murine myeloma cell is the NS-1 myeloma cell line (also termed P3- 
NS-l-Ag4-l), which is readily available from the NIGMS Human Genetic Mutant Cell 
Repository by requesting cell line repository number GM3573. Another mouse myeloma cell 
line that may be used is the 8-azaguanine-resistant mouse murine myeloma SP2/0 
non-producer cell line. 

Methods for generating hybrids of antibody-producing spleen or lymph node cells and 
myeloma cells usually comprise mixing somatic cells with myeloma cells in a 2:1 proportion, 
though the proportion may vary from about 20:1 to about 1:1, respectively, in the presence 
of an agent or agents (chemical or electrical) that promote the fusion of cell membranes. 
Fusion methods using Sendai virus have been described by Kohler and Milstein (1975; 
1976), and those using polyethylene glycol (PEG), such as 37% (v/v) PEG, by Gefter et al. 
(1977). The use of electrically induced fusion methods is also appropriate (Goding pp. 
71-74, 1986). 

Fusion procedures usually produce viable hybrids at low frequencies, about 1 x 10* 
to 1 x 10 s . However, this does not pose a problem, as the viable, fused hybrids are 
differentiated from the parental, unfused cells (particularly the unfused myeloma cells that 
would normally continue to divide indefinitely) by culturing in a selective medium. The 
selective medium is generally one that contains an agent that blocks the de novo synthesis of 
nucleotides in the tissue culture media. Exemplary and preferred agents are aminopterin, 
methotrexate, and azaserine. Aminopterin and methotrexate block de novo synthesis of both 
purines and pyrrolidines, whereas azaserine blocks only purine synthesis. Where aminopterin 
or methotrexate is used, the media is supplemented with hypoxanthine and thymidine as a 
source of nucleotides (HAT medium). Where azaserine is used, the media is supplemented 
with hypoxanthine. 
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The preferred selection medium is HAT. Only cells capable of operating nucleotide 
salvage pathways are able to survive in HAT medium. The myeloma cells are defective in 
key enzymes of the salvage pathway, e.g., hypoxanthine phosphoribosyl transferase (HPRT), 
and they cannot survive. The B cells can operate this pathway, but they have a limited life 
span in culture and generally die within about two weeks. Therefore, the only cells that can 
survive in the selective media are those hybrids formed from myeloma and B cells. 

This culturing provides a population of hybridomas from which specific hybridomas 
are selected. Typically, selection of hybridomas is performed by culturing the cells by 
single-clone dilution in microtiter plates, followed by testing the individual clonal 
supernatants (after about two to three weeks) for the desired reactivity. The assay should be 
sensitive, simple and rapid, such as radioimmunoassays, enzyme immunoassays, cytotoxicity 
assays, plaque assays, dot immunobinding assays, and the like. 

The selected hybridomas would then be serially diluted and cloned into individual 
antibody-producing cell lines, which clones can then be propagated indefinitely to provide 
MAbs. The cell lines may be exploited for MAb production in two basic ways. A sample 
of the hybridoma can be injected (often into the peritoneal cavity) into a histocompatible 
animal of the type that was used to provide the somatic and myeloma cells for the original 
fusion. The injected animal develops tumors secreting the specific monoclonal antibody 
produced by the fused cell hybrid. The body fluids of the animal, such as serum or ascites 
fluid, can then be tapped to provide MAbs in high concentration. The individual cell lines 
could also be cultured in vitro, where the MAbs are naturally secreted into the culture 
medium from which they can be readily obtained in high concentrations. MAbs produced by 
either means may be further purified, if desired, using filtration, centrifugation and various 
chromatographic methods such as HPLC or affinity chromatography. 

9. LTBP Structure and Function 



The LTBPs appear to be modular polypeptides characterized by the presence of 
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multiple cysteine-rich motifs. Molecular cloning of human LTBP-1 (Kanzaki et al , 1990) 
indicates, for example, that the molecule consists of 16 epidermal growth factor-like repeats 
with the potential to bind calcium (EGF-CB repeats), 3 copies of a unique motif containing 8 
cysteine residues, an RGD motif, and an 8 amino acid motif identical to the cell binding 
domain of the laminin 02 chain. EGF-CB repeats may be modified to contain 
hydroxyaspartic acid and hydroxyasparagine (Stenflo et al. , 1987). The genes that code for 
two LTBPs have been isolated previously. In sharp contrast to those of the prior art, the 
LTBP-3 sequence of the present invention shares only 40% sequence identity with those of 
the LTBP-1 sequence (Kanzaki et al, 1990) and the LTBP-2 sequence (Moren et al, 1994. 

Unlike the human LTBP genes isolated previously which are localized to human 
chromosome 2, band pl2-q22 (LTBP-1, Stenman et al, 1994) and human chromosome 14, 
band q24 (LTBP-2, Moren et al, 1994), the LTBP-3 of the present invention is localized to 
chromosome 11, band ql2. 

One aspect of the present invention is the mapping of the murine LTBP-3 to mouse 
chromosome 19, band B, a region of conserved synteny with human chromosome 11, band 
ql2. 

While the function of LTBP is unknown (i.e., studies with transfected CHO cells 
indicate that LTBP does not contribute to TGF-0 latency), several ideas have been proposed 
that, when taken together, suggest that LTBP may function as a extracellular structural 
protein capable of both regulating and targeting TGF-0 activity. First, LTBP may regulate 
the intracellular biosynthesis of latent TGF-/J. Cultured erythroleukemia cells efficiently 
assemble and secrete large latent TGF-0 complexes, but they poorly secrete latent TGF-/3 
complexes that lack LTBP-1 (Miyazono et al, 1991; 1992). The retained complexes contain 
anomalous disulfide bonds, suggesting that, for erythroleukemia cells at least, LTBP 
contributes to the normal assembly and secretion of TGF-0 latent complexes. Second, LTBP 
may function to target latent TGF-/3 to specific types of connective tissue. Recent evidence 
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suggests that the large latent TGF-j3 complex covalently binds the extracellular matrix via 
LTBP-1, i.e., LTBP-1 may target the latent TGF-/3 complex to a site near the cell surface 
(pericellular matrix) to facilitate the generation of autocrine or paracrine effects (Flaumenhaft 
et al, 1993; Taipale et al, 1994; Taipale and Keski-Oja, 1992). Third, LTBP may 
modulate the activation of latent complexes. There is direct evidence that LTBP-1 binds 
calcium (Colosetti et al, 1993) and that calcium binding induces a structural change that 
protects LTBP-1 from proteolytic attack. As described above (Lyons et al. , 1988; Taipale et 
al, 1995), latent TGF-0 is exposed to protease-rich environments during wound repair and 
normal development and exposure to these environments leads to the release of mature TGF- 
/3 from extracellular storage sites. It is therefore possible that the protease-resistant 
conformation of LTBP helps ensure TGF-/J integrity in vivo. 

10. LTBP and Skeletal Tissues 

Skeletal tissue represents one of the largest known repositories of latent TGF-/3 (200 
figfkg bone; Seyedin et al., 1986; Seyedin et al, 1987). Moreover, activated TGF-0 may 
stimulate boneformation in developing tissues and may act as a "coupling factor" that 
coordinates matrix resorption and formation during bone remodeling (Centrella et al, 1991). 
Finally, activated TGF-/3 may exert a powerful osteoinductive stimulus following fracture 
(Joyce et al. , 1990; Beck et al. , 1993). 

It has not yet been determined if TGF-/J normally exists in bone as a small or large 
latent complex. Previous studies (Pfeilschifter et al, 1990; Bonewald et al, 1991) have 
shown that the major form of TGF-0 in conditioned media from bone organ cultures is a 100 
kDa latent complex that lacks LTBP-1. Additionally, Dallas et al. (Dallas et al. , 1994) have 
demonstrated that cultured MG63, ROS 17/2.8, and UMR-106 cells (derived from 
osteosarcomas of various types) each secrete two major forms of latent TGF-/31, namely, a 
290 kDa complex that contains LTBP-1 and a 100 kDa complex lacking LTBP-1. A second 
high molecular weight complex that contained latent TGF-/32 and LTBP was also identified. 
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The presence and relative amount of low and high molecular weight complexes varied with 
cell type, and TGF-jSl and LTBP did not appear to be co-expressed in bone cells. These 
results led the authors to conclude that the 100 kDa latent TGF-0 complex is a 
physiologically important form in bone cells, i.e., LTBP did not appear to be required for 
the proper and efficient assembly and secretion of small latent complexes. 

At the end of their paper, Dallas et al. also acknowledged that the production of small 
versus large latent TGF-/3 complexes may be associated with specific stages in the maturation 
of bone cells — e.g., MG63, ROS 17/2.8, and UMR-106 cells are known to express 
different subsets of mature osteoblast phenotypic markers, which could explain differences in 
the size of the latent TGF-/3 complexes produced by the various osteosarcoma cell lines. 
Along this line, the inventors' laboratory has recently shown that mouse pre-osteoblast 
MC3T3-E1 cells express the ltbp-3 gene at the outset of osteoblast differentiation and that the 
LTBP-3 polypeptide binds TGF-/31 in MC3T3-E1 conditioned media (Yin et al., 1995). 
Therefore, MC3T3-E1 cells express large latent TGF-01 complexes bearing LTBP-3 
precisely at the time of the pre-osteoblast to osteoblast transition (i.e. , at -day 14 in culture, 
or at the onset^of alkaline phos phata se expression). 

Extending the MC3T3-E1 data further, the inventors' laboratory has also found that 
ltbp-2 and ltbp-3 are co-expressed with TGF-|8 in developing mouse skeletal tissues (Yin et 
al. , 1995) and that ltbp-3 (at least) is expressed at sites of osteotomy repair in vivo (Yin et 
al, 1994). 

There is little doubt that TGF-/3 contributes to the normal structure and function of 
skeletal tissues. Bone is an abundant source of latent TGF-/3, and mature TGF-j8 contributes 
to the processes of skeletal morphogenesis, bone remodeling, and bone repair. 
Consequently, bone must regulate the autocrine and paracrine effects of TGF-/3 with 
precision. The inventors' laboratory was the first to clone and map the mouse ltbp-2 and 
ltbp-3 genes, and it has obtained evidence that both genes are expressed during normal 
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murine skeletal morphogenesis and during bone osteotomy repair. In addition, in MC3T3-E1 
pre-osteoblasts TGF-/J is synthesized as a homodimer known as the small latent complex that 
covalently binds LTBP to form large latent complexes. These results suggest for the first 
time that LTBP facilitates the assembly and secretion of latent TGF-jS complexes and targets 
latent TGF-0 to bone matrix. With the availability of the ltbp-2 and ltbp-3 genes, the 
opportunity exists in the inventors' laboratory to gain further insight into LTBP structure and 
function and, in turn, the mechanism by which latent TGF-jS complexes can be targeted to 
bone matrix and cells in a controlled manner. 

11. Detection of LTBP-encoding DNA Segments 

The amount of an LTBP-2 or LTBP-3-encoding DNA segment present within a 
biological sample, such as blood, serum or PBMC sample, may be determined by means of a 
molecular biological assay to determine the level of a nucleic acid that encodes such an 
LTBP-2 or LTBP-3 polypeptide, or by means of an immunoassay to determine the level of 
the polypeptide itself. 

In a molecular biological method for detecting a cell that produces LTBP-2 or LTBP- 
3, one would obtain nucleic acids from one or more cells and analyze the nucleic acids to 
identify a nucleic acid segment that encodes LTBP-2 or LTBP-3. Such nucleic acids may be 
identified by length, where an appropriate assay would be a PCR™-based assay resulting in 
the identification of an LTBP-2 or LTBP-3-encoding mRNA transcript. Alternatively, the 
nucleic acid segment may be identified by sequence, which method generally includes 
identifying a transcript with a sequence of the present invention e.g., by Northern or 
Southern blotting using a discriminating probe prepared in accordance with SEQ ID NO:l or 
SEQ ID NO:3. 

The detection of a cell that produces LTBP-2 or LTBP-3-encoding DNA segment 
using a method based upon the sequence of an ltbp-2 or ltbp-3 transcript requires an ltbp-2 or 
ltbp-3 probe with a novel DNA sequence as disclosed herein. This imparts an evident utility 
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to the nucleic acid segments of the present invention, particularly the shorter ones. 

The presence of a substantially complementary nucleic acid sequence in a sample, or 
a significantly increased level of such a sequence in comparison to the levels in a normal or 
"control" sample, will thus be indicative of a sample that contains a cell that harbors an 
LTBP-2 or LTBP-3-encoding DNA segment. Here, substantially complementary LTBP-2 or 
LTBP-3-encoding nucleic acid sequences are those that have relatively little sequence 
divergence and that are capable of hybridizing under relatively stringent conditions, as 
discussed above. 

A variety of hybridization techniques and systems are known that can be used in 
connection with the detection aspects of the invention, including diagnostic assays such as 
those described in Falkow et al , U.S. Patent 4,358,535. Short coding or non-coding nucleic 
acid segment probes may also be employed as primers in connection with diagnostic PCR™ 
technology, as well as for use in any of a number of other PCR™ applications, including 
PCR™-based cloning and engineering protocols. 

In general, the "detection" of an LTBP-2 or LTBP-3-encoding DNA segment is 
accomplished by attaching or incorporating a detectable label into the nucleic acid segment 
used as a probe and "contacting" a sample with the labeled probe. In such processes, an 
effective amount of a nucleic acid segment that comprises a detectable label (a probe), is 
brought into direct juxtaposition with a composition containing target nucleic acids. 
Hybridized nucleic acid complexes may then be identified by detecting the presence of the 
label, for example, by detecting a radio, enzymatic, fluorescent, or even chemiluminescent 
label. 

Many suitable variations of hybridization technology are available for use in the 
detection of nucleic acids, as will be known to those of skill in the art. These include, for 
example, in situ hybridization, Southern blotting and Northern blotting. In situ hybridization 
describes the techniques wherein the target nucleic acids contacted with the probe sequences 
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are those located within one or more cells, such as cells within a clinical sample or even 
cells grown in tissue culture. As is well known in the art, the cells are prepared for 
hybridization by fixation, e.g., chemical fixation, and placed in conditions that allow for the 
hybridization of a detectable probe with nucleic acids located within the fixed cell. 

Alternatively, target nucleic acids may be separated from a cell or sample prior to 
contact with a probe. Any of the wide variety of methods for isolating target nucleic acids 
may be employed, such as cesium chloride gradient centrifugation, chromatography {e.g., 
ion, affinity, magnetic), phenol extraction and the like. Most often, the isolated nucleic acids 
will be separated, e.g., by size, using electrophoretic separation, followed by immobilization 
onto a solid matrix, prior to contact with the labelled probe. These prior separation 
techniques are frequently employed in the art and are generally encompassed by the terms 
"Southern blotting" and "Northern blotting". Although the execution of various techniques 
using labeled probes to detect LTBP-2 or LTBP-3-encoding DNA or RNA sequences in 
clinical samples are well known to those of skill in the art, a particularly preferred method is 
described in detail herein, in Examples 1 and 3. 

Kits for use in Southern and Northern blotting to identify LTBP-2 or LTBP-3- 
encoding DNA segments are also contemplated to fall within the scope of the present 
invention. Such kits will generally comprise, in suitable container means, ltbp-2 or ltbp-3 
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nucleic acid probes; unrelated probes for use as controls; and optionally, one or more 
restriction enzymes. 

12. Construction of Chimeric Promoter-Reporter Expression Plasmids 

Using the 5' upstream flanking sequence of the LTBP-2 and LTBP-3 genes, a series 
of restriction fragments from available phage genomic inserts may be generated and 
subcloned into a promoter-reporter expression plasmid. An example of such a vector is 
pGL3, a luciferase reporter vector (Promega), which has a strong 5' terminator of 
transcription, a multiple cloning site, the cDNA coding sequence of insect luciferase, a 
strong translation stop codon, and an intron/polyadenylation signal sequence derived from 
SV40. 

By assembling 9-15 overlapping promoter-reporter expression plasmids that cover an 
about 5 kb region of interest, the identification of potential as-acting elements is 
contemplated. Once the initial constructs have been characterized, regions of 5' upstream 
flanking sequence Jhat show strong promotion (or repression) of gene expression may be 
studied in careful detail. 

The Northern analysis and tissue in situ hybridization data presented herein suggest 
that ltbp-3 is highly expressed in developing mouse tissues. Additionally, the ltbp-3 
transcript appears to be highly expressed by MC3T3-E1 cells in culture. The structural 
features of this sequence are consistent with a so-called housekeeping gene, i.e., a TATA- 
less and CAATT-less promoter sequence that is 70% GC-rich. 

To overcome potential problems of low reporter gene expression, the inventors 
contemplate the use of pEU-CAT, a promoter-less CAT vector especially constructed for the 
analysis of weak promoters (Harduin-Lepers et al, 1993). Alternatively, modifying the 
reporter constructs by the addition of an ltbp-3 enhancer element is also contemplated to be 
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useful. 



13. Evaluating Promoter Function 

Promoter function may be evaluated by in vitro transfection studies using L cells or 
NIH3T3 cells, since they have been used successfully for this purpose. An interesting 
alternative is to use Drosophila SL2 cells which: lack the trans-acting factor Spl; co- 
transfection of SL2 cells with a plasmid encoding the Spl protein such as pP^Spl and a 
reporter construct containing putative Spl binding sites will result in reporter expression if 
the sites are functional (Courey and Tjian, 1988). L cell transfection is performed using 
standard protocols (Sambrook et ai, 1989). Briefly, subconfluent cells (covering -20% of 
a 100 mm plastic tissue culture dish) are washed 2x in DMEM tissue culture medium 
(GIBCO) and then incubated for 3 hrs. at 37°C in a sterile mixture of DEAE-dextran (0.25 
mg/ml), chloroquine (55 mg/ml), and 15 jtg plasmid DNA (Courey and Tjian, 1988). Cells 
are then shocked by incubation with 10% DMSO in sterile PBS for 2 min. at 37°C, washed 
2x with DMEM (Sambrook et al, 1989), and incubated in DMEM plus 10% fetal calf serum 
and antibiotics for72 nr. at 37°C^ Since DEAE-dextran is quite toxic to NIH3T3 cells, 
these cells may be transfected by electroporation.] 

Using the pGL3 reporter/expression vector, luciferase activity may be measured in 
cell lysates using Luciferase Assay System kit reagents (Promega) according to protocols 
provided by the manufacturer. To measure background CPM, 20 fil of cell lysate is added to 
a clean microcentrifuge tube and light activity is measured in a scintillation counter (System 
1400 scintillation counter, Wallac Nuclear; all channels open). The same procedure is used 
to measure background CPM of 100 fil luciferase substrate stock solution. Once the 
background CPM have been documented, homogenate and substrate are mixed and light 
emission is measured immediately. Enzyme activity values are routinely obtained in 
triplicate, normalized to 1 mg of total protein, and expressed as a mean value +_ the standard 
deviation. Student's t test is used to determine statistical significance of differences among 
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groups (95% confidence level). 

In one study, following transfection of COS cells with the pGL3 luciferase expression 
plasmid, an aliquot of a cell homogenate was assayed for enzyme activity using commercially 
available kits and according to the manufacturer's recommendations. An equal aliquot of a 
cell homogenate prepared from untransfected COS served as a negative control. Significant 
luciferase activity was found only in the homogenat ■ ; from transfected cells. 

14. Biological Functional Equivalents 

As mentioned above, modification and changes may be made in the structure of an 
osteotropic gene and still obtain a functional molecule that encodes a protein or polypeptide 
with desirable characteristics. The following is a discussion based upon changing the amino 
acids of a protein to create an equivalent, or even an improved, second-generation molecule. 
The amino acid changes may be achieved by changing the codons of the DNA sequence, 
according to the following codon table: 
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Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


lie 


I 


AUA 


AUC 


AUU 








Lys ine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


ccc 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine _ 


Va l _ 


_ V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











For example, certain amino acids may be substituted for other amino acids in a 
protein structure without appreciable loss of interactive binding capacity with structures such 
as, for example, antigen-binding regions of antibodies or binding sites on substrate 
molecules. Since it is the interactive capacity and nature of a protein that defines that 
protein's biological functional activity, certain amino acid sequence substitutions can be made 
in a protein sequence, and, of course, its underlying DNA coding sequence, and nevertheless 
obtain a protein with like properties. It is thus contemplated by the inventors that various 
changes may be made in the DNA sequences of osteotropic genes without appreciable loss of 
their biological utility or activity. 
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In making such changes, the hydropathic index of amino acids may be considered. 
The importance of the hydropathic amino acid index in conferring interactive biologic 
function on a protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate 
herein by reference). It is accepted that the relative hydropathic character of the amino acid 
contributes to the secondary structure of the resultant protein, which in turn defines the 
interaction of the protein with other molecules, for example, enzymes, substrates, receptors, 
DNA, artibodies, antigens, -md the like. 

Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982), these are: Isoleucine 
(+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); 
methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan 
(-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); 
aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar 
biological activity, i.e. , still obtain a biological functionally equivalent protein. In making 
such changes, the substitution of amino acids whose hydropathic indices are within ±2 is 
preferred, those which are within +1 are particularly preferred, and those within +0.5 are 
even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids can be made 
effectively on the basis of hydrophilicity. U.S. Patent 4,554,101, incorporated herein by 
reference, states that the greatest local average hydrophilicity of a protein, as governed by 
the hydrophilicity of its adjacent amino acids, correlates with a biological property of the 
protein. 

As detailed in U.S. Patent 4,554,101, the following hydrophilicity values have been 
assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 + 1); 
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glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); 
threonine (-0.4); proline (-0.5 + 1); alanine (-0.5); histidine *-0.5); cysteine (-1.0); 
methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 
phenylalanine (-2.5); tryptophan (-3.4). 

It is understood that an amino acid can be substituted for another having a similar 
hydrophilicity viue and still obtain a biologically equivalent, and in particular, an 
immunologically equivalent protein. In such changes, the substitution of amino acids whose 
hydrophilicity values are within +2 is preferred, those which are within + 1 are particularly 
preferred, and those within +0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on the 
relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions which 
take various of the foregoing characteristics into consideration are well known to those of 
skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed 
in the examples which follow represent techniques discovered by the inventors to function 
well in the practice of the invention, and thus can be considered to constitute preferred 
modes for its practice. However, those of skill in the art should, in light of the present 
disclosure, appreciate that many changes can be made in the specific embodiments which are 
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disclosed and still obtain a like or similar result without departing from the spirit and scope 
of the invention. 

EXAMPLE I 

ISOLATION OF A NOVEL LATENT TGF-0 
BINDING PROTEIN-LIKE LTBP-3 GENE 

The TGF-/3s represent a family of structurally related molecules with diverse effects 
on mammalian cell shape, growth, and differentiation (Roberts and Sporn, 1990). Initially 
synthesized as a precursor consisting of an amino-terminal propeptide followed by mature 
TGF-/3, two chains of nascent pro-TGF-jS associate in most tissues to form a Mr ~ 106,000 
inactive disulfide-bonded dimer. Homodimers are most common, but heterodimers have also 
been described (Cheifetz et al, 1987; Ogawa et al, 1992). During biosynthesis the mature 
TGF-/3 dimer is cleaved from the propeptide dimer. TGF-/J latency results in part from the 
non-covalent association of propeptide and mature TGF-/3 dimers (Pircher et al, 1984 and 
1986; Wakefield et al, 1987; Millan et al., 1992; see also Miyazono and Heldin, 1989). 
Consequently, the propeptide dimer is often referred to as the latency associated protein 
(LAP), and LAP plus the disulfI3e"-bonded TGF-/3 dimer are also known as the small latent 
complex. In the extracellular space small latent complexes must be dissociated to activate 
mature TGF-/5. The mechanism of activation of the latent complex is thought to be one of 
the most important steps governing TGF-/? effects (Lyons et al., 1988; Antonelli-Orlidge et 
al., 1989; Twardzik et al. , 1990; Sato et al, 1993). 

In certain lines of cultured cells small latent growth factor complexes may contain 
additional high molecular weight proteins. The best characterized of these high molecular 
weight proteins is the latent TGF-/5 binding protein, or LTBP (Miyazono et al. , 1988; 
Kanzaki et al, 1990; Tsuji et al, 1990; Olofsson et al, 1992; Taketazu et al, 1994). 
LTBP produced by different cell types is heterogeneous in size, perhaps because of 
alternative splicing or because of tissue-specific proteolytic processing (Miyazono et al, 
1988; Wakefield et al, 1988; Kanzaki et al, 1990; Tsuji et al, 1990). Latent TGF-/3 
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complexes that contain LTBP are known as large latent complexes. LTBP has no known 
covalent linkage to mature TGF-j3, but rather it is linked by a disulfide bond to LAP. 

Two LTBPs have been isolated to date. The deduced human LTBP-1 amino acid 
sequence is comprised of a signal peptide, 16 epidermal growth factor-like repeats with the 
potential to bind calcium (EGF-CB repeats), 2 copies of a unique motif containing 8 cysteine 
residues, an RGD cell attachment motif, and an 8 amino acid motif identical to the cell 
binding domain of the laminin B2 chain (Kanzaki et al, 1990). There is evidence that 
LTBP-1 binds calcium, which, in turn, induces a structural change that protects LTBP from 
proteolytic attack (Colosetti et al, 1993). LTBP-2 shows 41% sequence identity to LTBP-1, 
and its structural domains show a similar overall organization (Moren et al, 1994). 

While the functions of LTBP-1 and LTBP-2 presently are unknown, several ideas 
have been put forward in the literature. First, LTBP may regulate the intracellular 
biosynthesis of latent TGF-jS precursors. Cultured erythroleukemia cells efficiently assemble 
and secrete large latent TGF-jS complexes, whereas they slowly secrete small latent TGF-j8 
complexes that contain anomalous disulfide bonds (Miyazono et al, 1991; 1992). Therefore, 
LTBP may facilitate the normal assembly and secretion of latent TGF-/3 complexes. Second, 
LTBP may target latent TGF-j3 to specific types of connective tissue. Recent evidence 
suggests that the large latent TGF-j3 complex is covalently bound to the extracellular matrix 
via LTBP (Taipale et al, 1994). Based on these observations, LTBP has been referred to as 
a "matrix receptor", Le. a secreted protein that targets and stores latent growth factors such 
as TGF-/J to the extracellular matrix. Third, LTBP may modulate the activation of latent 
complexes. This idea is based in part on recent evidence which suggests that mature TGF-/J 
is released from extracellular storage sites by proteases such as plasmin and thrombin and 
that LTBP may protect small latent complexes from proteolytic attack (Falcone et al f 1993; 
Benezra et al, 1993; Taipale et al, 1994), i.e. protease activity may govern the effect of 
TGF-/3 in tissues, but LTBP may modulate this activity. Fourth, LTBP may plays an 
important role in targeting the latent TGF-j8 complex to the cell surface, allowing latent 
TGF-j8 to be efficiently activated (Flaumenhaft et al, 1993). 
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A. MATERIALS AND METHODS 



1. cDNA Cloning 

Aliquots (typically 40-50,000 PFU) of phage particles from a cDNA library in the 
XZAPII® vector made from NIH 3T3 cell mRNA (Stratagene) and fresh overnight 
XLl-Blue™ cells (grown in Luria broth supplemented with 0.4% maltose in 10 mM MgS0 4 ) 
were mixed, incubated for 15 min. at 37°C, mixed again with 9 ml of liquid (50°C) top 
layer agarose (NZY broth plus 0.75% agarose), and then spread evenly onto freshly poured 
150 mm NZY-agar plates. Standard methods were used for the preparation of plaque-lifts and 
filter hybridization (42°C, in buffer containing 50% formamide, 5X SSPE, IX Denhardt's, 
0.1% SDS, 100 mg/ml salmon sperm DNA, 100 mg/ml heparin). Filters were washed 
progressively to high stringency (0.1X SSC/0.1% SDS, 65°C). cDNA probes were 
radiolabeled by the nick translation method using commercially available reagents and 
protocols (Nick Translation Kit, Boehringer Mannheim). Purified phage clones were 
converted to pBluescript® plasmid clones, which were sequenced using Sequenase (v2.0) as 
described (Chen et al, 1993; Yin et al, 1995). Sequence alignment and identity was 
determined using sequence analysis programs from the Genetics Computer Group 
(Mac Vector). 

2. Tissue In Situ Hybridization 

To prepare normal sense and antisense probes, a unique 342 bp fragment from the 3' 
untranslated region (+3973 to +4314, counting the "A" of the initiator Met codon as +1; 
see "ish% FIG. 1) was subcloned into the pBSKS+ plasmid (Stratagene, Inc.). Template 
DNA was linearized with either EcoRL or BamHI, extracted, and precipitated with ethanol. 
Sense and antisense transcripts were generated from 1 mg template with T3 and T7 
polymerases in the presence of [ 35 S]UTP at >6 mCi/ml (Amersham, > 1200 Ci/mmol) and 
1.6 U/ml RNasin (Promega), with the remaining in vitro transcription reagents provided in a 
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kit (SureSite, Novagen Inc.). After transcription at 37°C for 1 h, DNA templates were 
removed by a 15 min. digest at 37°C with 0.5 U/ml RNase-free DNase I, extracted, and 
precipitated with ethanol. Riboprobes were hydrolyzed to an average final length of 150 bp 
by incubating in 40 mM NaHC0 3 , 60 mM Na 2 C0 3 , 80 mM DTT for -40 min. at 60°C. 
Hydrolysis was terminated by addition of sodium acetate, pH 6.0, and glacial acetic acid to 
.09 M and 0.56% (v/v), respectively, and the probes were then ethanol precipitated, 
dissolved in 0.1 M DTT, counted, and stored at -20°C until use. Day 8.5-9.0, day 13.5, 
and day 16.5 mouse embryo tissue sections (Novagen) and the in situ hybridization protocol 
were exactly as described (Chen et al , 1993; Yin et aL, 1995). 

3, Northern Analysis 

MC3T3-E1 cell poly(A+) RNA (2-10 mg aliquots) was electrophoresed on a 1.25% 
agarose/2.2 M formaldehyde gel and then transferred to a nylon membrane (Hybond-N, 
Amersham). The RNA was cross-linked to the membrane by exposure to a UV light source 
(1.2 x 10 6 mJ/cm 2 , UV Stratalinker 2400, Stratagene) and then pre-hybridized for > 15 min. 
at 65°C in Rapid-Hyb buffer (Amersham, Inc.). A specific cDNA probe consisting solely of 
untranslated sequence from the 3' end of the transcript was 32 P-labeled by random priming 
and used for hybridization (2 h at 65 °C). Blots were washed progressively to high 
stringency (0.1X SSC/0.1% SDS, 65 °C), and then placed against x-ray film with intensifying 
screens (XAR, Kodak) at -86°C. 

4. Antibody Preparation 

LTBP-3 antibodies were raised against a unique peptide sequence found in domain #2 
(amino acids 155-167). Peptide #274 (GESVASKHAIYAVC) (SEQ ID NO:5) was 
synthesized using an ABI model 431A synthesizer employing FastMoc chemistry. The 
sequence was confirmed using an ABI473 protein sequencer. A cysteine residue was added 
to the carboxy-terminus to facilitate crosslinking to carrier proteins. For antibody 
production, the synthetic peptide was coupled to rabbit serum albumin (RSA) using MBS 
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(m-maleimidobenzoic acid-N-hydroxysuccinimide ester) at a substitution of 7.5 mg peptide 
per mg of RSA. One mg of the peptide-RSA conjugate in 1 ml of Freund's complete 
adjuvant was injected subcutaneously at 10 different sites along the backs of rabbits. 
Beginning at 3 weeks after initial immunization, the rabbits were given bi-weekly booster 

5 injections of 1 mg peptide-RSA in 100 ul of Freund's incomplete adjuvant. IgG was 

prepared by mixing immune serum with caprylic acid (0.7 ml caprylic acid per ml serum), 
stirring for 30 min., and centrifuging at 5,000 X j for 10 min. The supernatant was 
decanted and dialyzed against two changes of phosphate buffered saline (PBS) overnight at 
4°C. The antibody solution was then affinity purified by passing it over a column containing 

10 the immunizing peptide coupled to Affi-gel 10 affinity support. Bound antibodies were 

eluted with 0.2 M glycine (pH 2.3), immediately dialyzed against PBS, and concentrated to 1 
mg/ml. prior to storage at -70°C. 



ill 5. Transaction 

jil5 

% Transient transfection was performed using standard protocols (Sambrook et al , 

tfl 1989). Briefly, subconfluent ce lls ( covering -20% of a 100 mm plastic tissue culture dish) 

□ were washed 2x in DMEM tissue culture medium (GIBCO) and then incubated for 3 hrs. at 

37°C in a sterile mixture of DEAE-dextran (0.25 mg/ml), chloroquine (55 mg/ml), and 15 
R20 mg plasmid DNA (Courey and Tjian, 1988). Cells then were shocked by incubation with 
S 10% DMSO in sterile PBS for 2 min. at 37°C, washed 2x with DMEM (Sambrook et al, 

1989), and incubated in DMEM plus 10% fetal calf serum and antibiotics for 72 hr. at 37°C 

6. Immunoprecipitation 

25 

For immunoprecipitation, 1 ml of antibody (1:400 final concentration, in PBS-TDS 
buffer: 0.38 mM NaCl, 2.7 mM KC1, 8.1 mM Na 2 HP0 4 , 1.5 mM KH 2 P0 4 , 1% Triton 
X-100, 0.5% Deoxycholic acid, and 0.1% SDS) was added to 1 ml of radiolabeled medium 
proteins. The mixture was incubated with shaking at 4°C for 1 hr., protein A-sepharose 
30 CL-4B beads were added (200 ml, 10% suspension), and this mixture was incubated with 
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shaking for one additional hour at 4°C. Immunoprecipitated proteins were pelleted by brief 
centrifiigation, the pellet was washed 6x with PBS-TF^S buffer, 2x protein loading dye was 
added, and the samples were boiled for 5 min. and then fractionated on 4-18% gradient 
SDS-PAGE (Bonadio et ai, 1985). Cold molecular weight markers (200 kDa-14.3 kDa, 
Rainbow mix, Amersham) were used to estimate molecular weight. The gel was dried and 
exposed to film for the indicated time at room temperature. 

7. Western Analysis 

Fractionated proteins within SDS-polyacrylamide gels were transferred to a 
nitrocellulose filter for 2 hours using Tris-glycine-methanol buffer, pH 8.3 at 0.5 mA/cm 2 . 
The filter was blocked, incubated with nonfat milk plus antibody (1:1000 dilution) for 2 hr, 
and washed. Antibody staining was visualized using the ECL Western blotting reagent 
(Amersham) according to the manufacturer's protocols. 

B. RESULTS 

In this study, the inventors isolated and characterized a novel murine fibrillin-like 
cDNA encoding LTBP-3. To clone the murine LTBP-3 gene, cDNA from a 3T3 cell cDNA 
library was amplified using human fibrillin- 1 PCR™ primers under low stringency conditions 
(i.e., annealing at 37°C initially for 10 cycles, followed by annealing at 60°C for 30 cycles). 
The results indicated that a murine DNA fragment of unexpectedly low homology (-50%) 
to human fibrillin- 1 was obtained. Molecular cloning of the authentic murine fibrillin- 1 
transcript was also performed, confirming the human and murine fibrillin-1 coding sequences 
share >95% sequence identity. The murine fibrillin- 1 and PCR™ sequences were different, 
which suggested that the PCR™ product may have been derived from a related, fibrillin-like 
cDNA. The 3T3 cell cDNA library was screened at high stringency using the murine PCR™ 
product as the probe in order to test this hypothesis. A cDNA walking strategy eventually 
yielded seven overlapping cDNA clones (FIG. 1). It provides a unique mRNA of 4,314 
nucleotides, with an open reading frame of 3,753 nucleotides (SEQ ID NO:3) ,FIG. 9). The 
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deduced molecule is a unique polypeptide of 1,251 amino acids (SEQ ID NO:4) (FlG. 10), 
Excluding the signal peptide (21 amino acids), the novel fibrillin-like molecule consists of 
five structurally distinct regions (Region 1- Region 5), and although similar to murine 
fibrillin- 1 (FIG. 2A), its domain structure is unique as is evidenced by the schematic 
representation of LTBP-3 shown in FIG. 2B. 

Domain #1 is a 28 airino acid seg nent with a net basic charge (est pi, 12.36) that 
may allow for binding acidic molecules in the extracellular matrix (e.g., acidic 
proteoglycans). Sequences rich in basic amino acids may also function as endoproteolytic 
processing signals (Barr, 1991; Steiner et aL, 1992), which suggests that the NH 2 -terminus 
may be proteolytically processed. Domain #2 extends for 390 amino acids, consisting of an 
EGF-like repeat, a 135 amino acid segment that was proline-rich (20.7%) and glycine-rich 
(11.8%) but not cysteine-rich, a Fibmotif (Pereira et aL, 1993), an EGF-CB repeat, and a 
TGF-bp repeat. Domain #3 is a 113 amino acid segment characterized by its high proline 
content (21%). Domain #4 extends for 678 amino acids and consists of 14 consecutive 
cysteine-rich repeats. Based on structural homologies, 12/14 repeats were epidermal growth 
factor-calcium binding (EGF-^B) motifs (Handford et aL, 1991), whereas 2/14 were 
transforming growth factor-/3-binding protein (TGF-bp) motifs (Kanzaki et aL, 1990). 
Finally, domain #5 is a 22 amino acid segment at the carboxy-terminus. The conceptual 
amino acid sequence encoded by the open reading frame consisted of 1,251 amino acids 
(FIG. 2B) with an estimated pi of 5.92, a predicted molecular mass of 134,710 Da, and five 
potential TV-linked glycosylation sites. No RGD sequence was present. 

Northern blot analysis of murine embryo RNA using a 3' untranslated region probe 
identified a transcript band of -4.6 kb. In this regard, 4,310 nt have been isolated by 
cDNA cloning, including a 3' untranslated region of 401 nt and a 5' upstream sequence of 
156 nt. The apparent discrepancy between the Northern analysis result and the cDNA 
sequence analysis suggested that the 5' upstream sequence may include -300 nt of additional 
upstream sequence. This estimate was consistent with preliminary primer extension mapping 
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studies indicating that the 5' upstream sequence is 400-500 nt in length. 

A total of 19 cysteine-rich repeats were found in domains #2 and #4 of the murine 
LTBP-like (LTBP-3) polypeptide. Thirteen were EGF-like and 11/13 contained the calcium 
binding consensus sequence. This consensus was derived from an analysis of 154 EGF-CB 
repeats in 23 different proteins and from structural analyses of the EGF-CB repeat, both 
bound and unbound to calcium ion (Selander-Sunneihagen et al , 1992). Variations on the 
consensus have been noted previously and one of these, D-L-N/D-E-Q, was identified in the 
third EGF-like repeat of domain #4. In addition, a potential calcium binding sequence which 
has not previously been reported (E-T-N/D-E-Q) was identified in the first EGF-like repeat 
of domain #4. Ten of thirteen EGF-CB repeats also contained a second consensus sequence 
which represents a recognition sequence for an Asp/Asn hydroxylase that co- and 
post-translationally modifies D/N residues (Stenflo et al, 1987; Gronke et al, 1989). 

Although about one-half the size, the deduced polypeptide was organized like fibrillin- 
1 in that it consisted of a signal peptide followed by 5 structurally distinct domains, i.e., two 
domains with numerous EGF-like, EGF-CB and Fib repeats and a third with a proline-rich 
sequence (Pereira et al , 1993). However, comparison of each of these domains using the 
GAP and BESTFIT programs (Genetics Computer Group) has revealed a low level of amino 
acid homology of only 27% over the five structural domains shared by the deduced murine 
polypeptide and human fibrillin-2. These values are low for a putative fibrillin family 
member because fibrillin-1 and fibrillin-2 share —50% identity (Zhang et al, 1994). 

A search of available databases revealed that the deduced murine polypeptide was 
most similar to the human and rat latent TGF-0 binding proteins (Kanzaki et al , 1990; Tsuji 
et al , 1990). In this regard LTBP-3 was found to be similar to fibrillin in that it could also 
be divided into five structurally distinct domains (FIG. 2A, FIG. 2B, and FIG. 2C). These 
include a relatively short domain downstream of the signal peptide with a net basic charge 
(amino acids 21-33, est. pi, 11.14); a domain consisting of EGF-like, EGF-CB, TGF-bp, and 
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Fib motifs plus a proline-rich and glycine-rich sequence (amino acids 34-407); a proline-rich 
domain (amino acids 408-545); a large, domain consisting of EGF-CB, TGF-bp, and TGF- 
bp-like repeat motifs (amino acids 546-1379); and a relatively short domain at the carboxy 
terminus (amino acids 1380-1394). Amino acid sequence comparison of the deduced murine 
and human polypeptides shows 60% identity for domain #1, 52% identity for domain #2, 
30% identity for domain #3, 43% identity for domain #4, and 7% identity for domain #5. 
The average idertity over the five domains shared by the murine polypeptide and human 
LTBP was 38.4%. Significantly, cysteine residues in both polypeptide sequences were 
highly conserved. 

The fibrillins are exclusively expressed by connective cells in developing tissues 
(Zhang et al , 1994), whereas LTBP should be expressed along with TGF-j3 by both 
epithelial and connective cells (Tsuji et al , 1990). The structural homology data therefore 
predict that the murine LTBP-3 gene shown in FIG. 2B should be expressed by both 
epithelial and connective tissue cells. Tissue in situ hybridization was used to test this 
hypothesis. 

An overview of the expression pattern as determined by tissue in situ hybridization is 
presented in FIG. 3A, FIG. 3B, FIG. 3C, FIG. 3D, FIG. 3E and FIG. 3F. 

Approximate mid-sagittal sections of normal murine embryos at days 8.5-9.0, 13.5 
and 16.5 p.c. of development were hybridized with a 35 S-labeled single stranded normal 
sense riboprobe from the same cDNA construct was used. At day 8.5-9.0 of development, 
intense gene expression was observed in the mesometrial and anti-mesometrial uterine 
tissues, ectoplacental cone, placenta, placental membranes. The transcript appeared to be 
widely expressed in murine embryo mesenchymal/connective tissue compartments, including 
the facial mesenchyme, at days 8.5-9.0, 13.5 and 16.5 of development. Particularly intense 
expression of the transcript was noted in the liver. 
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Microscopy of day 8.5-9.0 embryos confirmed the widespread expression of the 
murine gene by mesenchymal cells. Significant expression of the transcript by cells of the 
developing central nervous system, somites and cardiovascular tissue (myocardium plus 
endocardium) was also observed. 

Microscopy of day 13.5 and day 16.5 embryos demonstrated expression of the murine 
gene by skeletal muscle cells and by cells involved in intramembranous and endochondral 
bone formation. The transcript was expressed by osteoblasts and by periosteal cells of the 
calvarium, mandible and maxilla. The transcript was also identified in both cartilage and 
bone of the lower extremity. A positive signal was detected in perichondrial cells and 
chondrocytes (proliferating > mature > hypertrophic) of articular cartilage, the presumptive 
growth plate, and the cartilage model within the central canal. The positive signal was also 
expressed by blood vessel endothelial cells within the mid-diaphysis, and the surrounding 
muscle cells. 

Respiratory epithelial cells lining developing small airways and connective tissue cells 
in the pulmonary interstitium expressed the murine transcript, as did myocardial cells (atria 
and ventricles) and endocardial cushion tissue. Cells within the walls of large arteries also 
expressed the transcript. Expression of the murine gene was identified in several organs of 
the alimentary system, including the tongue, esophagus, stomach, small and large intestine, 
pancreas and liver. Mucosal epithelial cells lining the upper and lower digestive tract plus 
the smooth muscle and connective tissue cells found in the submucosa expressed the 
transcript, as did acinar cells of the exocrine pancreas. Despite the high level of transcript 
expression in the liver, these results suggest both cell populations express the LTBP-3 
transcript. 

In the kidney, expression above the basal level was observed in cells of developing 
nephrons, the ureteric bud, kidney blastema and the kidney interstitium. In the skin, 
epidermal and adnexal keratinocytes, dermal connective tissue cells, and brown fat cells 
within the dorsal subcutis expressed the murine transcript. In the central and peripheral 
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nervous systems, ganglion cells within the cerebrum, brainstem, spinal cord, and peripheral 
nerves expressed the murine transcript. The transcript was also intensely expressed by cells 
of the developing murine retina. 

Thus, the murine gene is widely expressed by both epithelial and connective tissue 
cells, a pattern that would be expected for a latent TGF-j3 binding protein. Three final 
observations argue that the LTBP-like (LTBP-3) sequence presented herein is not simply the 
murine homologue of human LTBP. First, domain #4 of the murine LTBP-like (LTBP-3) 
sequence has a smaller number of EGF-like repeat motifs than human and rat LTBP (8 
versus 11). Second, portions of the human and rat LTBP-like coding sequence were 
characterized and found to share -90% identity with human and rat LTBP but only 65% 
identity with the murine LTBP-like gene. Third, the human LTBP and LTBP-like genes are 
localized to separate chromosomes. Human LTBP was assigned to human chromosome 2 
based on the analysis of human x rodent somatic cell hybrid lines (Stenman et al , 1994). 
The present invention represents the first mapping of an LTBP gene in the murine. The 
human LTBP-like genes was recently localized to chromosome 11 band ql2, while the 
murine gene was mapped to murine chromosome 19, band B (a region of conserved 
synteny), using several independent approaches, including fluorescent in situ hybridization. 

The first indication of alternative splicing came from molecular cloning studies in the 
murine, in which independent cDNA clones were isolated with a deletion of 51 bp from the 
coding sequence. PCR™/Southern blot analysis provided additional evidence that the 
homologous 51 bp sequence was alternatively spliced in normal murine embryo tissues. 

Northern blot analysis also demonstrated that the novel fibrillin gene was also 
expressed in rat callus three weeks after osteotomy, after mineralization has begun. Gene 
expression in normal adult rat bone tissue was insignificant, which suggests that microfibrils 
are an important part of the bone fracture healing response. The novel fibrillin-like gene was 
expressed in callus as a pair of alternatively spliced transcripts. This result has been 
independently reproduced on three occasions. Molecular cloning of the novel fibrillin gene 
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in both murine and rat has identified potential splice junction sites for the alternative splicing 
event. 

MC3T3-E1 murine pre-osteoblasts were used to demonstrate that the murine gene 
5 product was capable of binding TGF-jS. MC3T3-E1 cells were utilized because they 
synthesize and secrete TGF-/3, which may act as an autocrine regulator of osteoblast 
proliferation (Amarnani et al , 1993; Van Vlasselaer et al , 1994; Lopez-Casillas et al , 
1994). 

10 To determine whether or not MC3T3-E1 cells co-expressed the murine gene product 

of TGF-j8, cells were plated on 100-mm dishes under differentiating conditions (Quarles 
et al , 1992) and the medium was replaced twice weekly. Parallel dishes were plated and 
Jj assayed for cell number and alkaline phosphatase activity, which confirmed that osteoblast 
^ differentiation was indeed taking place. Equal aliquots of total cellular RNA was prepared 

if from these MC3T3-E1 cells after 5, 14 and 28 days in culture for Northern blot analysis. As 

m 

^ shown m FIG. 4, expression of the new murine gene peaked on day 14 of culture. Since 

yi MC3T3-E1 cells also show a peak in alkaline phosphatase activity on day 14 of culture 

Q (Quarles et al , 1992), the results suggest for the first time that LTBP-3 gene expression is 

m 

rv an early marker of osteoblast differentiation. 

m 

g C. DISCUSSION 

This study reports the molecular cloning of a novel LTBP-3 gene that contains 
numerous EGF-like repeats. Northern analysis indicates that the gene encodes a single 

25 transcript of -4.6 kb in murine embryo tissues. The deduced amino acid sequence of the 

murine gene product appears to be a secreted polypeptide of 1,251 amino acids. Although it 
is similar to fibrillin, the overall structural organization and expression pattern of this gene 
product most resembles LTBP, a latent TGF-/J binding protein that was originally isolated 
and characterized by Heldin and co-workers (Kanzaki et al , 1990). Several observations 

30 strongly suggest that LTBP and the murine LTBP-like gene product are therefore derived 
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from related but distinct genetic loci. First, LTBP and the LTBP-like coding sequence share 
-40% identity and differences exist in the number of EGF-CB repeats in the deduced 
polypeptide sequence of the two molecules. Second, a portion of the murine LTBP gene has 
been cloned and shown to share -90% identity with human and rat LTBP. Conversely, 
portions of the human and rat LTBP-like genes have been cloned and shown to share -90% 
identity with the murine LTBP-like gene. Third, LTBP and the LTBP-like gene reside on 
different human chromosomes (Stenman et al , 1904). Taken together, these data suggest 
that a family of at least two LTBP genes exists. 

Similarities in the structural organization of LTBP-1 and the fibrillin- 1 and fibrillin-2 
polypeptides have been noted previously (Pereira et al, 1993; Zhang et al, 1994; Taipale 
et al , 1994). For example, LTBP-1 and the fibrillins are all secreted extracellular matrix 
constituents. Moreover, each polypeptide can be organized into five domains, two of which 
consists predominantly of EGF-CB and TGF-bp repeat motifs. LTBP-1 and fibrillin- 1 also 
share a domain that is proline-rich, and LTBP possesses an 8-cysteine repeat previously 
referred to as the "Fib motif" because it was assumed to be unique to fibrillin (Pereira et al , 
1993). These similarities likely explain the initial isolation and cloning of the LTBP-3 PCR™ 
product, especially since the human oligonucleotide primers used to initially amplify murine 
cDNA were designed to direct the synthesis of an EGF-CB repeat in domain #4. 

Another point of distinction between LTBP-3 and fibrillin concerns the spacing of 
conserved cysteines C4 and C5 in EGF-like repeats. Fibrillin- 1 and fibrillin-2 each contain 
>50 such repeats, and in every one the spacing is C 4 -X-C 5 . While this pattern is repeated in 
a majority of the EGF-like repeats in LTBP-1 and LTBP-2, both genes also contain repeats 
with the spacing C 4 -X-X-C 5 . Although the significance of this observation is unclear, 
variation in the number of amino acids between C 4 and C 5 would not be expected to alter the 
function of the EGF-like repeat. Mature EGF is a 48 amino acid secreted polypeptide 
consisting of two subdomains that have few interdomain contacts (Engel, 1989; Davis, 1990). 
The larger NH 2 -terminal subdomain consists of residues 1-32 and is stabilized by a pair of 
disulfide bonds (C r C 3 and C 2 -C 4 ), whereas the smaller COOH-terminal subdomain (amino 
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acids 33-48) is stabilized by a single disulfide bond (C 5 -C 6 ). The COOH-terminal subdomain 
has a highly conserved conformation that only is possible if certain residues and the distances 
between them are well conserved, while conformation-sequence requirements for the NH2- 
terminal subdomain are relatively relaxed. Variation in C 4 -C 5 spacing would not be expected 
to alter conformation because these residues do not normally form a disulfide bond and the 
spacing variation occurs at the interface of subdomains that would not be predicted to 
interact. The Zoning of additional genes will decide whether variation in C 4 -C 5 spacing is a 
reliable discriminator between members of the LTBP and fibrillin gene families. 

The LTBP-3 gene is expressed more widely during development than fibrillin- 1 or 
fibrillin-2. Studies in developing murine tissues have shown that the Fbn-1 gene is expressed 
by mesenchymal cells of developing connective tissue, whereas the murine LTBP-like gene is 
intensely expressed by epithelial, parenchymal and stromal cells. Earlier reports have 
suggested that TGF-jS plays a role in differentiation and morphogenesis during murine 
development (Lyons and Moses, 1990), when TGF-/3 is produced by epithelial, parenchymal 
and stromal cells. Tsuji et al. , (1990) and others have suggested that the expression of TGF- 
jS binding proteins jhould mirror that of TGF-jS itself; the expression pattern of the LTBP-3 
gene over the course of murine development is consistent with this expectation. However, 
the LTBP-3 gene may not be completely co-regulated with TGF-jS. TGF-jS gene and protein 
expression during murine development has been surveyed extensively (Heine et a/., 1987; 
Lehnert and Akhurst, 1988; Pelton et al, 1989; Pelton et a/., 1990a, b; Millan et a/., 1991); 
these studies have not identified expression by skeletal muscle cells, chondrocytes, 
hepatocytes, ganglion cells, mucosal cells lining the gut, and epithelial cells of developing 
nephrons. It is conceivable that the LTBP-3 molecule has an additional function in certain 
connective tissues besides targeting TGF-/S. 

The binding properties of the LTBP-3 gene product are under investigation. 
Formally, the LTBP-3 polypeptide may bind a specific TGF-jS isoform, another member of 
the TGF-j3 superfamily (e.g., a bone morphogenetic protein, inhibin, activin, or Mullerian 
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inhibiting factor), or a growth factor unrelated to TGF-jS. Anti-peptide antibodies to the 
murine LTBP-3 polypeptide have been generated and osteoblast cell lines that express the 
molecule at relatively high levels have been identified. Studies with these reagents suggest 
that LTBP-3 assembles intracellular^ into large latent complexes with a growth factor that is 
being characterized by immunological methods. 

The presence of dibasic amino aci Is ia the LTBP-3 sequence sugge ts that it may 
undergo cell- and tissue-specific proteolysis. TGF-j8 regulates extracellular matrix 
production by suppressing matrix degradation (through a decrease in the expression of 
proteases such as collagenase, plasminogen activator, and stromelysin plus an increase in the 
expression of proteinase inhibitors such as plasminogen activator inhibitor- 1 and tissue 
inhibitor of metalloproteinase-1) and by stimulating matrix macromolecule synthesis (for 
recent review, see Lyons and Moses, 1990; Massague, 1990; Laiho and Keski-Oja, 1992; 
Miyazono et al, 1992). Conversely, production of extracellular matrix has been shown to 
down regulate TGF-j8 gene expression (Streuli et al , 1993). TGF-jS may therefore regulate 
extracellular matrix production through a sophisticated feedback loop that influences the 
expression of a relatively large number of genes. LTBP-1 and LTBP-2 may contribute to 
this regulation by facilitating the assembly and secretion of large latent growth factor 
complexes and then targeting the complex to specific connective tissues (Taipale et al , 
1994). 

If LTBP-3 is like LTBP-1, it has the potential to function as a secreted, extracellular 
structural protein. As demonstrated here, domain #1 of LTBP-3 appears to be a unique 
sequence that likely has a globular conformation. Domain #1 also is highly basic and may 
facilitate LTBP-3 binding to acidic molecules (e.g., acidic proteoglycans) within the 
extracellular space. Sequences rich in basic amino acids have also been shown to function as 
endoproteolytic processing signals for several peptide hormones (Barr, 1991; Steiner et al, 
1992). It is possible, therefore, that the NH 2 -terminus of LTBP-3 is proteolytically 
processed in a tissue-specific manner. 
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Domains #2 and #4 consist of consecutive cysteine-rich repeats, the majority of which 
are of the EGF-CB type. Besides binding calcium (Corson et al, 1993), these repeats may 
provide LTBP-3 with regions conformation capable of interacting with other matrix 
macromolecules (Engel, 1989). Domain #3 is proline rich and may be capable of bending 
(or functioning like a hinge) in three-dimensional space (MacArthur and Thornton, 1991). 
(In this regard, domain #2 is of interest because it has a similar stretch of 135 amino acids 
that is both proLae- and glycine-rich. Since glycine-rich sequences are also thought to be 
capable of bending or functioning like a hinge in three-dimensional space, this amino acid 
sequence may interrupt the extended conformation of domain #2, thereby providing it with a 
certain degree of flexibility in three-dimensional space.) 

Domain #5 also appears to be a unique sequence having a globular conformation. 
The absence of a known cell attachment motif may indicate that, in contrast to LTBP-1, the 
LTBP-3 molecule may have a more limited role in the extracellular matrix (i.e. , that of a 
structural protein) in addition to its ability to target latent TGF-jS complexes to specific 
connective tissues. 

MC3T3-E1 pre-osteoblasts co-express LTBP-3 and TGF-/31 and these proteins form a 
complex in the culture medium. These results are particularly interesting because bone 
represents one of the largest known repositories of latent TGF-/3 (200 ^g/kg bone; Seyedin 
et al. , 1986 and 1987), and because this growth factor plays a critical role in the 
determination of bone structure and function. For example, TGF-j8 is thought to (i) provide 
a powerful stimulus to bone formation in developing tissues, (ii) function as a possible 
"coupling factor" during bone remodeling (a process that coordinates bone resorption and 
formation), and (iii) exert a powerful bone inductive stimulus following fracture. Activation 
of the latent complex may be an important step governing TGF-0 effects, and LTBP may 
modulate the activation process (e.g., it may "protect" small latent complexes from 
proteolytic attack). 



Expression of large latent TGF-0 complexes bearing LTBP may be physiologically 
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relevant to, i.e. , may be part of the mechanism of, the pre-osteoblast -> osteoblast 
differentiation cascade. This is based on the evidence that MC3T3-E1 cells express large 
latent TGF-/31 complexes bearing LTBP-2 precisely at the time of transition from the 
pre-osteoblast to osteoblast phenotype (-day 14 in culture, or, at the onset of alkaline 
phosphatase expression; see Quarles et al, 1992). The organ culture model, for example, 
likely is comprised of differentiated osteoblasts but few bond progenitors, making it a 
difficult model i, best in which to study the differentiation cascade (Dallas et al. , 1984). It 
is also well known that MG63, ROS17/2.8 and UMR 106 cells are rapidly dividing and they 
express the osteoblast phenotype. Thus, these osteoblast-like cell lines do not show the 
uncoupling of cell proliferation and cell differentiation that characterizes the normal 
(physiologically relevant) pre-osteoblast -* osteoblast transition (Gerstenfeld et al. , 1984; 
Stein and Lian, 1993). Therefore, the production of small versus large latent TGF-/? 
complexes may be associated with specific stages in the maturation of bone cells. 

LTBP-3 may bind calcium, since EGF-CB repeats have been shown to mediate high 
affinity calcium binding in LTBP-1 and other proteins (Colosetti et al., 1993). Calcium 
binding, in turn, may contribute to molecular conformation and the regulation of its 
interactions with other molecules. The presence of dibasic amino acids suggests that LTBP-3 
may also undergo cell- and tissue-specific proteolysis. TGF-0 regulates extracellular matrix 
production by suppressing matrix degradation (through a decrease in the expression of 
proteases such as collagenase, plasminogen activator, and stromelysin plus an increase in the 
expression of proteinase inhibitors such as plasminogen activator inhibitor- 1 and tissue 
inhibitor of metalloproteinase-1) and by stimulating matrix macromolecule synthesis (for 
recent reviews, see Lyons and Moses, 1990; Massague, 1990; Laiho and Keski-Oja, 1992; 
and Miyanzono et al. , 1993). Conversely, production of extracellular matrix has been shown 
to down regulate TGF-j8 gene expression (Streuli et al., 1993). TGF-/3 may therefore 
regulate extracellular matrix production through a sophisticated feedback loop that influences 
the expression of a relatively large number of genes. LTBP-1, LTBP-2, and LTBP-3 may 
contribute to this regulation by facilitating the assembly and secretion of large latent growth 
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factor complexes and then targeting the complex to specific connective tissues (Taipale et al , 
1994). 

EXAMPLE H 
PREPARATION OF LTBP-3 ANTIBODIES 

An affinity-purified antibody (#274) capable of immunoprecipitating was prepared 
against the murine LTBP-3 gene product. Full-length murine cDNAs were assembled into 
the pcDNA3 mammalian expression vector (Invitrogen) and expressed following transient 
transfection of 293T cells. Nascent polypeptides, radiolabeled by addition of 35 S Cys to the 
medium of transfected cells, were immunoprecipitated using affinity-purified antibody #274. 
As shown in FIG. 5, the new murine polypeptide was estimated to be 180-190 kDa. To 
ensure the specificity of #274 binding, preincubation with 10 jug of synthetic peptide was 
shown to block immunoprecipitation of the 180-190 kDa band. 

Finally, MC3T3-E1 cells were cultured for 7 days under differentiating conditions and 
double-labeled with 30 juCi/ml 35 S cysteine and 35 S methionine in deficient media. 
Radiolabeled media was dialyzed into cold PBS with protease inhibitors. Aliquots of the 
dialyzed medium sample (10 6 incorporated CPM) were analyzed by a combined 
immunoprecipitation/ Western analysis protocol. The murine polypeptide was clearly and 
reproducibly secreted by MC3T3-E1 cells, migrating under reducing conditions as a single 
band of 180-190 kDa (FIG. 6). Consistent with the results of previous studies {e.g. , 
Miyazono et al, 1988; Dallas et al, 1994; Moren et al, 1994), bands of 70 and 50 kDa 
corresponding to the TGF-/31 precursor were co-immunoprecipitated with the 180 kDa 
LTBP-3 protein. Weak bands of 40 and 12 kDa were also identified in experiments in which 
only immunoprecipitation was performed. The latter were not included in FIG. 6 because 
they migrated within that portion of the gel included in the Western analysis. Protein bands 
of 70-12.5 kDa are not variant forms of LTBP-3; FIG. 5 demonstrates that LTBP-3 migrates 
as a single band of 180-190 kDa following transient transfection of 293T cells, which fail to 
make TGF-/?1. By immunoprecipitation, a unique band consistent with monomeric mature 
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TGF-jSl was found in the LTBP-2 immunoprecipitate. Antibody #274 is incapable of 
binding TGF-/31 as determined by radioimmunoassay using commercially available reagents 
(R&D Systems) and the manufacturer's suggested protocols. These results have been 
reproduced in 6 independent experiments which utilized 3 separate lots of MC3T3-E1 
medium. Thus the new murine LTBP-3 polypeptide binds TGF-jS in vitro. 

In co-tram 'action studies of 293T Cells using pLTBP-3fl and pTGF-/31, 
immunoprecipitation of LTBP-3 and TGF-/31 was demonstrated by 293T cells following 
transient transfection and radiolabeling. Aliquots (~ 10 6 incorporated CPM) of radiolabeled 
media were immunoprecipitated and separated using 4%-18% gradient ST^S-PAGE and either 
reducing or nonreducing conditions as described (Yin et al., 1995) OFIG. 13). 

EXAMPLE ffl 
ISOLATION OF A GENE ENCODING MURINE LTBP-2 

In addition to determining the DNA and corresponding polypeptide sequence of the 
murine LTBP-3 gene, the murine LTBP-2 gene was also cloned and sequenced. 

1. Cloning and DNA sequencing of LTBP-2 

To identify new murine LTBP family members, independently designed degenerate 
oligonucleotide primers were synthesized based on a structural homology shared by human 
LTBP-1 and mouse LTBP-3 coding sequences: forward primer, 

5 * - AAACGTC AC ACGTGAI ACGTGAACGTTGCTTGCTGG-3 ? (SEQ ID NO: 12); reverse 
primer, 5 '-TTACGTCCACGT AC ACGTCTAGC AAGCAAGC A-3 ' (SEQ ID NO: 13), and 
then used PCR™ to amplify single-stranded mouse embryo cDNA prepared from normal CD- 
1 mouse embryo mRNA. A band of approximately 400 basepairs (bp) was isolated and 
purified by agarose gel electrophoresis, the DNA was ligated into the TA cloning vector 
(InVitrogen), and the ligation mixture was used to transform competent bacteria. Plasmid 
DNA (from 28 colony forming units) was prepared and evaluated by DNA sequence analysis. 
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As determined by sequence identity comparison, 16/28 plasmid DNAs coded for mouse 
LTBP-1, 11/28 coded for mouse LTBP-3, and 1/28 coded for an apparently unique sequence. 
The insert DNA from the unique plasmid was then used as a probe to screen a cDNA library 
prepared from 3T3 cells (Stratagene, Inc.). A walking strategy eventually yieldedJhe- 
overlapping cDNA clones shown in /FIG. 1 J'. Analysis of these clones identified an open 
reading frame of 5,430 base pairs. Coifiparison of sequence identity using the GAP and 
BESTFIT programs (Genetics Computer Group) revealed 79.7% identity between the mouse 
open reading frame and human LTBP-2 (Centrella et al, 1991), but j< 47.1% identity 
between the mouse open reading frame and human LTBP-1 and mouse LTBP-3. The 
sequence comparison data agreed with chromosomal localization data, which collectively 
established that the sequence was the mouse homolog of human LTBP-2. 

The level of amino acid sequence identity (approximately 40%) among the LTBP-1, - 
2, and -3 polypeptides is in the range observed for other protein isoforms that contain 
multiple EGF-like repeats, like the fibrillins (Yin et al , 1995) and the diverse laminin chains 
(Engel, 1989). 

An LTBP-2 methionine codon in a favorable context for translation initiation was 
provisionally designated the translation start site (see Kozak, 1991). The deduced initiator 
methionine was followed by a signal sequence of approximately 35 amino acids. Consistent 
with the structure and length of the human LTBP-2 signal peptide (Centrella et al, 1991), 
the 15 residues immediately downstream of the mouse LTBP-2 initiator methionine were 
largely hydrophilic in nature, whereas amino acids 16-35 represented a typical hydrophobic 
signal peptide sequence. The small neutral amino acid residues Ser (-3) and Ala (-1) and the 
large polar residue Gin ( + 1) appeared to define the signal peptide cleavage site (von Heijne, 
1983). 

In contrast to LTBP-1 and -3, which appear to be organized into 5 structurally 
distinct domains downstream of the signal peptide (Yin et al , 1995), the deduced mouse 
LTBP-2 polypeptide consists of ten alternating structural domains that are composed of either 
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proline- and glycine-rich sequences or cysteine-rich repeat motifs. Thus, domain 1 (amino 
acids 36-160) was composed of 19.4% glycine and proline residues. Domain 2 (amino acids 
161-213) consisted of 2 EGF-like repeats. Domain 3 (amino acids 214-344) was composed 
of 223% glycine and proline residues. Domain 4 (amino acids 345-413) consisted of 2 
cysteine-rich repeats. Domain 5 (amino acids 414-536) was composed of 19.5% glycine and 
proline residues. Domain 6 (amino acids 537-708) consisted of 3 cysteine-rich repeats; based 
on structural homologies, the first repeat was a Fib motif (Pereira et al , 1993; a copy of this 
reference has been included in the Appendix), the second was an epidermal growth factor- 
calcium binding (EGF-CB) motif (Handford et al , 1990), and the third was a transforming 
growth factor-/31-binding protein (TGF-bp) motif (Kanzaki et al , 1990). Domain 7 (amino 
acids 709-831) was composed of 20.3% proline and glycine residues. Domain 8 (amino 
acids 832-1626) consisted of 15 EGF-like repeats and 2 TGF-bp repeats. Domain 9 (amino 
acids 1627-1721) was composed of 29.5% glycine and proline residues. Domain 10 (amino 
acids 1722-1810) consisted of 2 EGF-like repeat motifs. 

The conceptual mouse LTBP-2 amino acid sequence consists of 1,810 amino acids, 
with an estimated jpl of 5.02, a predicted molecular mass of 197,917 Da., and eight potential 
iV-linked glycosylation sites. Similar to the mouse LTBP-3 polypeptide, RGD and laminin 
B2 chain cell adhesion sequences were not identified. Altogether, 26 cysteine-rich repeats 
were found in the mouse LTBP-2 polypeptide. As described above, 20/26 were 
characterized by the presence of 6 cysteine residues and therefore were EGF-like. 12/20 
showed the general consensus D/N-I/V-D/N-E/D-Q, derived from an analysis of 154 EGF- 
like repeats in 23 different proteins and from structural analysis of the coagulation factor X 
EGF-CB repeat, both bound and unbound to calcium ion (Selander-Sunnerhagen et al , 
1992). Variations on the consensus have been noted previously (for examples see Yin et al , 
1995; Yin et al, 1995), and two of these, were identified in mouse LTBP-2. Two potential 
variants which have not previously been reported were also identified (D-A-D-E-Q and D-H- 
N-E-CJ, giving a total of 16 putative EGF-CB repeats. All 16 repeats also contained a 
proposed recognition sequence (C 3 -X-D/N-X-X-X-X-Y/F-X-C 4 ) for an Asp/Asn hydroxylase 
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that co-and posttranslationally modifies D/N residues (Stenflo et ah , 1987; Gronke et ah , 
1989). Previous NMR studies of the isolated first EGF-like domain of human factor IX 
indicate that 3 residues derived from the general consensus and from the recognition 
sequence are direct ligands for calcium (Handford et ah, 1990; 1995; 1991). This has led to 
a proposed calcium-binding consensus D/N, D/N, D*/N* (where * denotes ajS-hydroxylated 
residue). A fourth residue in the consensus as originally proposed, F/Y, is now known not 
to be a direct ligand for calcium (Hughes et ah, 1993). The three amino acids that are direct 
ligands for calcium in factor IX are conserved in each of the 16 putative EGF-CB repeats 
identified in mouse LTBP-2. 

Four of five of the putative domains rich in proline and glycine were also rich in 
basic amino acid residues (domain 1, 15.5%; domain 3, 13.1%; domain 5, 11%; domain 7, 
8.1%; and domain 9, 3.2%). Co-existence of proline and basic amino acids suggests the 
possibility that LTBP-2 undergoes 'proline-directed' endoproteolytic processing (Devi, 1991). 
Indeed, both monobasic and dibasic cleavage motifs — e.g., R-R and R-X-X-R, respectively 
(Barr, 1991)— were identified in all five postulated proline- and glycine-rich structural 
domains. In several instances, monobasic cleavage motifs occurred near or within potential 
dibasic cleavage motifs (e.g. , Arg 108, Arg 286, Arg 429, and Lys 727). One potential 
monobasic cleavage motif was identified in each of the 5 proline- and glycine-rich domains. 
Dibasic cleavage motifs in general were more prevalent near the deduced LTBP-2 amino 
terminus. Endoproteolytic cleavage of LTBP is of potential interest because it may help 
explain the smaller than expected size of platelet LTBP-1. 

The complete cDNA nucleotide sequence for murine LTBP-2 is shown in (SEQ ID 
NO:l) (FIG. 7). /The deduced amino acid sequence is shown in (SEQ ID NO:2) (FIG. 8). 

2. Mouse ltbp-2 Gene Expression in Developing Perichondrium 

The inventors have demonstrated that ltbp-3 is widely and intensely expressed in both 
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developing maternal tissues (e.g., uterine decidua) and mouse embryo tissues (e.g., 
mesenchyme, connective tissue, epithelia, and parenchyma). Tissue in situ hybridization was 
used to compare and contrast the developmental expression of lthp-2 and Ubp-3 { FIG. 12A. . 
FIG. 12B, and FIG 12C present an overview of ltbp-2 expression in a mid-sagittahscctibn of 
a mouse embryo at day 16.5 p.c. of development, when expression is strongest. The section 
was hybridized with a 35 S-labeled single stranded antisense riboprobe synthesized from a 580 
base pair cDNA coding for the mouse LTBP-2 3' u translated region. The probe showed 
<30% sequence identity with the 3' untranslated sequences of human ltbp~l and ltbp-2, 
which is too low to give spurious hybridization signals under our conditions. A 35 S-labeled 
single stranded normal sense riboprobe from the same cDNA construct was used as a 
negative control, ltbp-2 expression above background was observed in the snout, base of the 
skull, tail, paw, lung, vertebrae, and large vessels of mouse embryos. Microscopy of day 
16.5 p.c. embryo tissue sections, taken from the same slide used to prepare whole mount 
sections shown in £IG. 12 A, demonstrated that the pattern of hybridization was due to 
significant ltbp-2 geiie expression by perichondrial and vascular wall cells. Positive signals 
were detected, for example, in perichondrial cells of cartilage aggregates tocated in the 
vertebral column (v), forelimb and tail, and at the base of the skull (FIG. 12B). Indeed, the 
perichondrium (pc) of all cartilage aggregates observed in these motrse embryo tissue sections 
was positively hybridized, ltbp-2 was also expressed by vascular wall cells of the aorta (ao), 
and in blood vessels within lung parenchyma and within the connective tissue supporting hair 
follicle structures associated with the snout (s). In contrast, ltbp-2 was expressed at 
insignificant levels (i.e., below the experimental background) in the generalized 
mesenchyme/connective tissue, brain, peripheral nerve, tooth rudiment, lung epithelium, 
cardiac and skeletal muscle, gut epithelium, liver parenchyma, pancreas epithelium and islets 
of Langerhans, brown fat cells, and kidney parenchyma. These in situ hybridization results, 
which were reproduced using independent tissue sections, demonstrate for the first time that 
ltbp-2 expression in developing mouse tissues is more restricted than that of ltbp-3. 

3. Chromosomal Localization of Mouse ltbp-2 Gene 
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The murine ltbp-2 gene was assigned to a mouse chromosome by PCR™ analysis of 
genomic DNA from a mapping panel consisting of 19 mouse x Chinese hamster and 1 mouse 
x rat somatic cell hybrid lines as described (Li et al , 1995). A PCR™ product of the 
expected size (600 bp) was obtained from hybrid cells that had retained mouse chromosome 
12. All other mouse chromosomes (except chromosome 12) were excluded by at least four 
discordant hybrids. Fluorescent in situ hybridization using murine ltbp-2 genomic and cDNA 
probes generate^ identical results that localized the ltbp-2 gene to mouse chromosome 12, 
band D. 14/20 metaphase spreads analyzed exhibited a fluorescent signal on both chromatids 
of chromosome 12 at the band D site, and 10/12 had signals on both chromosome 12 
homologs. No specific signals were seen on other chromosomes. This region in the mouse 
is a conserved syntenic region with human chromosome 14, band 14q24, the site of the 
human LTBP-2 genetic locus (Moren et al , 1994), thereby providing strong support for the 
notion that the murine ltbp-2 gene is the true homolog of human LTBP-2. 

EXAMPLE IV 
EXPRESSION OF RECOMBINANT LTBP PROTEIN 

The Pichia Expression Kit (Invitrogen, Inc.) may be used to prepare recombinant 
LTBP protein. This kit, based on the methylotrophic yeast, Pichia pastoris, allows high- 
level expression of recombinant protein in an easy-to-use relatively inexpensive system. In 
the absence of the preferred carbon source, glucose, P. pastoris utilizes methanol as a carbon 
source. The AOX1 promoter controls the gene that codes for the expression of the enzyme 
alcohol oxidase, which catalyzes the first step in the metabolism of methanol. This 
promoter, which is induced by methanol, has been characterized and incorporated into a 
series of Pichia expression vectors. This feature of Pichia has been exploited to express high 
levels of recombinant proteins often in the range of grams per liter. Because it is eukaryotic, 
P. pastoris utilizes posttranslational modification pathways that are similar to those used by 
mammalian cells. This implies that the recombinant LTBP-2 or LTBP-3 protein will be 
glycosylated and will contain disulfide bonds. 
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For preparation of a recombinant LTBP-2 or LTBP-3 protein, the native LTBP-2 or 
LTBP-3 cDNA is modified by the addition of a commercially available epitope tag (the HA 
epitope, Pharmacia, LKB Biotechnology, Inc.). Such fragments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, by application of nucleic 
acid reproduction technology, such as the PCR™ technology of U.S. Patent 4,683,202 (herein 
incorporated by reference) or by introducing selected sequences into recombinant vectors for 
recombinant production. (PCR™ is a regi tered trademark of Hoffmann-Le loche, Inc.). This 
is followed by cloning into the Pichia expression vector. The resulting plasmid is 
characterized by DNA sequence analysis, linearized by digestion with NotI, and spheroplasts 
will be prepared and transformed with the linearized construct according to the 
manufacturer's recommendations. 

Transformation facilitates a recombination event in vivo between the 5' and 3* AOX1 
sequences in the Pichia vector and those in the Pichia genome. The result is the replacement 
of AOX1 with the gene of interest. 

Transformants are then plated on histidine-deficient media, which will select for 
successfully transformed cells. Transformants are further selected against slow growth on 
growth media containing methanol. Positive transformants are grown for 2 days in liquid 
culture and then for 2-6 days in broth that uses methanol as the sole carbon source. Protein 
expression is evaluated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS- 
PAGE) and Western hybridization using a commercially available polyclonal antisera to the 
HA epitope (Pharmacia). Recombinant LTBP-2 or LTBP-3 protein may be purified 
according to the manufacturer's recommendations, dialyzed against double distilled, 
deionized water and lyophilized in 10 mg aliquots. 
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All of the compositions and methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to 
the composition, methods and in the steps or in the sequence of steps of the method 
described herein without departing from the concept, spirit and scope of the invention. More 
specifically, it will be apparent that certain agents which are both chemically and 
physiologically related may be substituted for the agents described herein while the same or 
similar results would be achieved. All such similar substitutes and modifications apparent to 
those skilled in the art are deemed to be within the spirit, scope and concept of the invention 
as defined by the appended claims. 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE : 

(A) NAME/KEY: CDS 
10 (B) LOCATION: 1..5499 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GAG AGC ACC TCC CCG CGA GGT CTC CGG TGC CCA CAG CTC TGC AGC 48 
15 Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin Leu Cys Ser 
15 10 15 

CAC TCT GGC GCC ATG AGA GCG CCG ACC ACC GCT CGC TGC TCC GGA TGC 96 
His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys Ser Gly Cys 
20 20 25 30 

ATC CAA CGG GTG CGT TGG AGG GGC TTC CTG CCA CTT GTC CTG GCT GTC 144 
lie Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val Leu Ala Val 
35 40 45 

25 

U TTG ATG GGG ACA AGT CAT GCC CAA CGG GAT TCC ATA GGG AGA TAC GAA 192 

y!3 Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser lie Gly Arg Tyr Glu 

|H 50 55 60 

g0 CCA GCT AGC AGG GAT GCG AAT CGG TTG TGG CAC CCC GTG GGC AGC CAC 24 0 

fy Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val Gly Ser His 

gl 65 70 75 80 

ft' CCC GCA GCG GCT GCA GCC AAG GTG TAC AGT CTG TTC CGA GAG CCT GAC 28 8 

N35 Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg Glu Pro Asp 
* - ~ ~ 85 ' 90 95 

^ GCG CCG GTC CCC GGC TTG TCG CCC TCT GAG TGG AAC CAG CCG GCC CAG 336 

Y! Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin Pro Ala Gin 

,NK) 100 105 110 

ffl GGG AAC CCG GGA TGG CTC GCA GAG GCC GAG GCC AGG AGG CCA CCT CGA 384 

Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg Pro Pro Arg 
115 120 125 



45 



ACC CAG CAG CTG CGT CGA GTC CAG CCA CCT GTC CAG ACT CGG AGA AGC 432 
Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr Arg Arg Ser 
130 135 140 



50 CAT CCC CGG GGC CAG CAG CAG ATA GCA GCC CGG GCT GCA CCT TCT GTC 480 

His Pro Arg Gly Gin Gin Gin lie Ala Ala Arg Ala Ala Pro Ser Val 
145 150 155 160 

GCG CGC CTG GAA ACC CCT CAG CGA CCC GCG GCT GCA CGG CGA GGG CGG 528 
55 Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 

165 170 175 

CTC ACT GGG AGA AAT GTC TGC GGG GGA CAG TGC TGC CCA GGA TGG ACA 576 
Leu Thr Gly Arg Asn Val Cys Gly Gly Gin Cys Cys Pro Gly Trp Thr 
60 180 185 190 

ACA TCA AAC AGC ACC AAC CAC TGT ATC AAA CCT GTG TGT CAG CCT CCC 624 
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Thr Ser Asn Ser Thr Asn His Cys lie Lys Pro Val Cys Gin Pro Pro 
195 200 205 

TGT CAG AAC CGA GGC TCC TGC AGC AGG CCC CAG GTC TGC ATC TGC CGT 
Cys Gin Asn Arg Gly Ser Cys Ser Arg Pro Gin Val Cys lie Cys Arg 
210 215 220 

TCT GGC TTC CGT GGG GCG CGC TGT GAG GAG GTC ATC CCT GAG GAG GAA 
Ser Gly Phe Arg Gly Ala Arg Cys Glu Glu Val He Pro Glu Glu Glu 
225 230 235 240 

TTT GAC CCT CAG AAT GCC AGG CCT GTG CCC AGA CGC TCA GTG GAG AGA 
Phe Asp Pro Gin Asn Ala Arg Pro Val Pro Arg Arg Ser Val Glu Arg 
245 250 255 

GCA CCC GGT CCT CAC AGA AGC AGT GAG GCC AGA GGA AGT CTA GTG ACC 
Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg Gly Ser Leu Val Thr 
260 265 270 

AGA ATA CAG CCG CTG GTA CCA CCA CCA TCA CCA CCT CCA TCT CGG CGC 
Arg He Gin Pro Leu Val Pro Pro Pro Ser Pro Pro Pro Ser Arg Arg 
275 280 285 

CTC AGC CAG CCC TGG CCC CTG CAG CAG CAC TCA GGG CCG TCC AGG ACA 
Leu Ser Gin Pro Trp Pro Leu Gin Gin His Ser Gly Pro Ser Arg Thr 
290 295 300 

GTT CGT CGG TAT CCG GCC ACT GGT GCC AAT GGC CAG CTG ATG TCC AAC 
Val Arg Arg Tyr Pro Ala Thr Gly Ala Asn Gly Gin Leu Met Ser Asn 
305 310 315 320 

GCT TTG CCT TCA GGA CTC GAG CTG AGA GAC AGC AGC CCA CAG GCA GCA 
Ala Leu Pro Ser Gly Leu Glu Leu Arg Asp Ser Ser Pro Gin Ala Ala 
325 330 335 

CAT GTG AAC >CAT"CTC TCA CCC~CCC TGG GGG CTG AAC CTC ACC GAG AAA 
His Val Asn His Leu Ser Pro Pro Trp Gly Leu Asn Leu Thr Glu Lys 
340 345 350 

ATC AAG AAA ATC AAA GTC GTC TTC ACC CCC ACC ATC TGC AAG CAG ACC 
He Lys Lys He Lys Val Val Phe Thr Pro Thr He Cys Lys Gin Thr 
355 360 365 

TGT GCC CGG GGA CGC TGT GCC AAC AGC TGT GAG AAG GGT GAC ACC ACC 
Cys Ala Arg Gly Arg Cys Ala Asn Ser Cys Glu Lys Gly Asp Thr Thr 
370 375 380 

ACC TTG TAC AGT CAG GGT GGC CAT GGG CAT GAC CCC AAG TCT GGC TTC 
Thr Leu Tyr Ser Gin Gly Gly His Gly His Asp Pro Lys Ser Gly Phe 
385 390 395 400 

CGT ATC TAT TTC TGC CAA ATC CCC TGC CTG AAT GGT GGC CGC TGC ATC 
Arg He Tyr Phe Cys Gin He Pro Cys Leu Asn Gly Gly Arg Cys He 
405 410 415 

GGC CGG GAC GAG TGC TGG TGT CCA GCC AAC TCC ACA GGA AAG TTC TGC 
Gly Arg Asp Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 
420 425 430 

CAT CTG CCT GTC CCG CAG CCA GAC AGG GAA CCT GCA GGG CGA GGT TCC 
His Leu Pro Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 
435 440 445 
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10 



690 695 700 

TGC AGC CGT GTG GGC AAA GCC TGG GGT AGC ACA TGT GAA CAG TGT CCC 2160 
Cys Ser Arg Val Gly Lys Ala Trp Gly Ser Thr Cys Glu Gin Cys Pro 
705 710 715 720 

CTG CCT GGC ACA GAA GCC TTC AGG GAG ATC TGC CCT GCT GGC CAT GGC 2208 
Leu Pro Gly Thr Glu Ala Phe Arg Glu He Cys Pro Ala Gly His Gly 
725 730 735 

TAC ACC TAC TCG AGC TCA GAC ATC CGC CTG TCT ATG AGG AAA GCC GAA 2256 
Tyr Thr Tyr Ser Ser Ser Asp He Arg Leu Ser Met Arg Lys Ala Glu 
^40 745 750 

15 GAA GAG GAA CTG GCT AGC CCC TTA AGG GAG CAG ACA GAG CAG AGC ACT 2304 

Glu Glu Glu Leu Ala Ser Pro Leu Arg Glu Gin Thr Glu Gin Ser Thr 
755 760 765 

GCA CCC CCA CCT GGG CAA GCA GAG AGG CAA CCA CTC CGG GCA GCC ACC 23 52 

20 Ala Pro Pro Pro Gly Gin Ala Glu Arg Gin Pro Leu Arg Ala Ala Thr 
770 775 780 

GCC ACC TGG ATT GAG GCT GAG ACC CTC CCT GAC AAA GGT GAC TCT CGG 2400 
Ala Thr Trp He Glu Ala Glu Thr Leu Pro Asp Lys Gly Asp Ser Arg 
f 25 785 790 795 800 

f2 GCT GTT CAG ATC ACA ACC AGT GCT CCC CAC CTA CCT GCC CGG GTA CCA 2448 

111 Ala Val Gin He Thr Thr Ser Ala Pro His Leu Pro Ala Arg Val Pro 

%fl 805 810 815 

y;: ggg gat gcc act gga aga cca gca cca tcc ttg cct gga cag ggc att 2496 

U'i Gly Asp Ala Thr Gly Arg Pro Ala Pro Ser Leu Pro Gly Gin Gly He 

08 820 825 830 

7. r : 

^35 CCA GAG AGT £CA_GCA GAA GAO^QAA GTG ATT CCC TCC AGT GAT GTC TTG 2 544 

Pro Glu Ser Pro Ala Glu Glu Gin Val He Pro Ser Ser Asp Val Leu 
S 835 840 845 



^ GTG ACA CAC AGC CCC CCA GAC TTT GAT CCA TGT TTT GCT GGA GCC TCC 2592 

^40 Val Thr His Ser Pro Pro Asp Phe Asp Pro Cys Phe Ala Gly Ala Ser 
850 855 860 



rti 



•SJSSS' 



O ATC TGT GGC CCT GGG ACC TGT GTG AGC CTC CCA AAT GGA TAC AGA 2640 

Asn He Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 
45 865 870 875 880 



50 



TGT GTC TGC AGC CCT GGC TAC CAG CTA CAC CCC AGC CAA GAC TAC TGT 2688 
Cys Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 
885 890 895 

ACT GAT GAC AAC GAG TGT ATG AGG AAC CCC TGT GAA GGA AGA GGG CGC 273 6 

Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 
900 905 910 



55 TGT GTC AAC AGT GTG GGC TCC TAC TCC TGC CTC TGC TAT CCT GGC TAC 2 784 

Cys Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 
915 920 925 

ACA CTA GTC ACC CTC GGA GAC ACA CAG GAG TGC CAA GAT ATC GAT GAG 2832 
60 Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp He Asp Glu 

930 935 940 
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TGT GAG CAG CCC GGG GTG TGC AGT GGT GGG CGA TGC AGC AAC ACG GAG 2 880 

Cys Glu Gin Pro Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 
945 950 955 960 

5 GGC TCG TAG CAC TGC GAG TGT GAT CGG GGC TAC ATC ATG GTC AGG AAA 2928 

Gly Ser Tyr His Cys Glu Cys Asp Arg Gly Tyr lie Met Val Arg Lys 
965 970 975 

GGA CAC TGT CAA GAT ATC AAC GAA TGC CGT CAC CCT GGT ACC TGC CCT 2 976 

10 Gly His Cys Gin Asp lie Asn Glu Cys Arg His Pro Gly Thr Cys Pro 

980 985 990 

GAT GGG AGA TGC GTC AAC TCC CCT GGC TCC TAC ACT TGT CTG GCC TG m 3 024 

Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 
15 995 1000 1005 

GAG GAG GGC TAT GTA GGC CAG AGT GGG AGC TGT GTA GAT GTC AAT GAG 3072 

Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 
1010 1015 1020 

20 

TGT CTG ACC CCT GGG ATA TGT ACC CAT GGA AGG TGC ATC AAC ATG GAA 3120 

Cys Leu Thr Pro Gly lie Cys Thr His Gly Arg Cys lie Asn Met Glu 
1025 1030 1035 1040 

25 GGC TCC TTT AGA TGC TCC TGT GAG CCG GGC TAT GAG GTC ACC CCA GAC 3168 

"I' Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 

S 1045 1050 1055 

J% AAG AAG GGC TGC CGA GAT GTG GAC GAG TGT GCC AGC CGA GCC TCG TGC 3216 

3»D Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 

— 1060 1065 1070 

fp CCC ACG GGC CTC TGC CTC AAC ACG GAG GGC TCC TTC ACC TGC TCA GCC 3264 

\Pi Pro Thr Gly Leu Cys Leu Asn Thr Glu Gly Ser Phe Thr Cys Ser Ala 

45 1 P 75 , _ . K 1080 1085 

O TGT CAG AGC GGG TAC TGG GTG AAC GAA GAT GGC ACT GCC TGT GAA GAC 3312 

ff: Cys Gin Ser Gly Tyr Trp Val Asn Glu Asp Gly Thr Ala Cys Glu Asp 

TJ 1090 1095 1100 
IP 

\y TTG GAT GAA TGT GCC TTC CCT GGA GTC TGC CCC ACA GGC GTC TGC ACC 3360 

O Leu Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Gly Val Cys Thr 

□ 1105 1110 1115 1120 

45 AAT ACT GTA GGC TCC TTC TCC TGC AAG GAC TGT GAC CAG GGC TAC CGG 3408 

Asn Thr Val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly Tyr Arg 
1125 1130 1135 

CCC AAC CCC CTG GGC AAC AGA TGC GAA GAT GTG GAT GAG TGT GAA GGT 3456 
50 Pro Asn Pro Leu Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 

1140 1145 1150 

CCC CAA AGC AGC TGC CGG GGA GGC GAA TGC AAG AAC ACA GAA GGT TCC 3504 
Pro Gin Ser Ser Cys Arg Gly Gly Glu Cys Lys Asn Thr Glu Gly Ser 
55 1155 1160 1165 

TAC CAA TGC CTC TGT CAC CAG GGC TTC CAG CTG GTC AAT GGC ACC ATG 35 52 

Tyr Gin Cys Leu Cys His Gin Gly Phe Gin Leu Val Asn Gly Thr Met 
1170 1175 1180 



60 



TGT GAG GAC GTG AAT GAG TGT GTT GGG GAA GAG CAT TGT GCT CCT CAC 3600 
Cys Glu Asp Val Asn Glu Cys Val Gly Glu Glu His Cys Ala Pro His 
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1185 



1190 



1195 



1200 
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CAG AAC TCC ACA CAG GCC GAG TGC TGC TGC ACT CAG GGT GCC AGA TGG 4368 
Gin Asn Ser Thr Gin Ala Glu Cys Cys Cys Thr Gin Gly Ala Arg Trp 
1445 1450 1455 

5 GGA AAG GCC TGT GCG CCC TGC CCA TCT GAG GAC TCA GTT GAA TTC AGT 4416 

Gly Lys Ala Cys Ala Pro Cys Pro Ser Glu Asp Ser Val Glu Phe Ser 
1460 1465 1470 

CAG CTC TGC CCC AGT GGT CAA GGT TAC ATC CCA GTG GAA GGA GCC TGG 4464 
10 Gin Leu Cys Pro Ser Gly Gin Gly Tyr lie Pro Val Glu Gly Ala Trp 
1475 1480 1485 

ACA TTT GGA CA* ACC ATG TAT ACA GAT GCC GAT GAA TGT GTA CTG TTT 4512 
Thr Phe Gly Gin Thr Met Tyr Thr Asp Ala Asp Glu Cys Val Leu Phe 
15 1490 1495 1500 

GGG CCT GCT CTC TGC CAG AAT GGC CGA TGC TCA AAC ATA GTG CCT GGC 4560 
Gly Pro Ala Leu Cys Gin Asn Gly Arg Cys Ser Asn He Val Pro Gly 
1505 1510 1515 1520 



20 



60 



TAC ATT TGC CTG TGC AAC CCT GGC TAC CAC TAT GAT GCC TCC AGC AGG 4608 
Tyr He Cys Leu Cys Asn Pro Gly Tyr His Tyr Asp Ala Ser Ser Arg 
1525 1530 1535 



f§5 AAG TGC CAG GAT CAC AAC GAA TGC CAG GAC TTG GCC TGT GAG AAC GGT 4656 

'% Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 
^ 1540 1545 1550 

HI 

J! GAG TGT GTG AAC CAA GAA GGC TCC TTC CAT TGC CTC TGC AAT CCC CCC 4704 

r30 Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
1H 1555 1560 1565 

pi 

£0 CTC ACC CTA GAC CTC AGT GGG CAG CGC TGT GTG AAC ACG ACC AGC AGC 4752 

m Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
J35 1570 _ ^ _ 15 ZL- 1580 

Q ACG GAG GAC TTC CCT GAC CAT GAC ATC CAC ATG GAC ATC TGC TGG AAA 4800 

Si Thr Glu Asp Phe Pro Asp His Asp He His Met Asp He Cys Trp Lys 

is 1585 1590 1595 1600 



AAA GTC ACC AAT GAT GTG TGC AGC CAG CCC TTG CGT GGG CAC CAT ACC 4848 
Lys Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 



45 ACC TAT ACA GAA TGC TGC TGC CAA GAT GGG GAG GCC TGG AGC CAG CAA 4896 

Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

TGC GCT CTG TGC CCG CCC AGG AGC TCT GAG GTC TAC GCT CAG CTG TGC 4 944 

50 Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 
1635 1640 1645 

AAC GTG GCT CGG ATT GAG GCA GAG CGC GGA GCA GGG ATC CAC TTC CGG 4992 
Asn Val Ala Arg He Glu Ala Glu Arg Gly Ala Gly He His Phe Arg 
55 1650 1655 1660 

CCA GGC TAT GAG TAT GGC CCT GGC CTG GAC GAT CTG CCT GAA AAC CTC 5040 
Pro Gly Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 



TAC GGC CCA GAT GGG GCT CCC TTC TAT AAC TAC CTA GGC CCC GAG GAC 5088 
Tyr Gly Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
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1685 1690 1695 

ACT GCC CCT GAG CCT CCC TTC TCC AAC CCA GCC AGC CAG CCG GGA GAC 
Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1705 1710 

AAC ACA CCT GTC CTT GAG CCT CCT CTG CAG CCC TCT GAA CTT CAG CCT 
Asn Thr Pro Val Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1725 

CAC TAT CTA GCC AGC CAC TCA GAA CCC CCT GCC TCC TTC GAA GGC CTT 
His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 

CAG GCT GAG GAA TGT GGC ATC CTG AAT GGC TGT GAG AAT GGC CGC TGC 
Gin Ala Glu Glu Cys Gly He Leu Asn Gly Cys Glu Asn Gly Arg Cys 
1745 1750 1755 1760 

GTG CGT GTG CGG GAG GGC TAC ACT TGC GAC TGC TTT GAG GGC TTC CAG 
Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 
1765 1770 1775 

CTG GAT GCG CCC ACA TTG GCC TGT GTG GAT GTG AAC GAG TGT GAA GAC 
Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
1780 1785 1790 

TTG AAC GGG CCT GCA CGA CTC TGT GCA CAC GGT CAC TGT GAG AAC ACA 
Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 
1795 1800 1805 

GAG GGT TCC TAT CGC TGC CAC TGT TCG CCA GGT TAC GTG GCA GAG CCA 
Glu Gly Ser Tyr Arg Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pro 
1810 1815 1820 

GGC CCC CCA CACJTGT GCG GCC AAG GAG 
Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin Leu Cys Ser 
15 10 15 

His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys Ser Gly Cys 
20 25 30 

He Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val Leu Ala Val 
35 40 45 

Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser He Gly Arg Tyr Glu 
50 55 60 
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Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val Gly Ser His 
65 70 75 80 

Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg Glu Pro Asp 
85 90 95 

Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin Pro Ala Gin 
100 105 no 

Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg Pro Pro Arg 
115 120 125 

Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr Arg Arg Ser 
130 135 140 

His Pro Arg Gly Gin Gin Gin He Ala Ala Arg Ala Ala Pro Ser Val 
145 150 155 160 

Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 
165 170 175 

Leu Thr Gly Arg Asn Val Cys Gly Gly Gin Cys Cys Pro Gly Trp Thr 
180 185 190 

Thr Ser Asn Ser Thr Asn His Cys He Lys Pro Val Cys Gin Pro Pro 
195 200 205 

Cys Gin Asn Arg Gly Ser Cys Ser Arg Pro Gin Val Cys He Cys Arg 
210 215 220 

Ser Gly Phe Arg Gly Ala Arg Cys Glu Glu Val He Pro Glu Glu Glu 
225 230 235 240 

Phe Asp Pro Gin Asn Ala Arg Pro Val Pro Arg Arg Ser Val Glu Arg 
_245 _ 250 255 

Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg Gly Ser Leu Val Thr 
260 265 270 

Arg He Gin Pro Leu Val Pro Pro Pro Ser Pro Pro Pro Ser Arg Arg 
275 280 285 

Leu Ser Gin Pro Trp Pro Leu Gin Gin His Ser Gly Pro Ser Arg Thr 
290 295 300 

Val Arg Arg Tyr Pro Ala Thr Gly Ala Asn Gly Gin Leu Met Ser Asn 
305 310 315 320 

Ala Leu Pro Ser Gly Leu Glu Leu Arg Asp Ser Ser Pro Gin Ala Ala 
325 330 335 

His Val Asn His Leu Ser Pro Pro Trp Gly Leu Asn Leu Thr Glu Lys 
340 345 350 

He Lys Lys He Lys Val Val Phe Thr Pro Thr He Cys Lys Gin Thr 
355 360 365 

Cys Ala Arg Gly Arg Cys Ala Asn Ser Cys Glu Lys Gly Asp Thr Thr 
370 375 380 

Thr Leu Tyr Ser Gin Gly Gly His Gly His Asp Pro Lys Ser Gly Phe 
385 390 395 400 
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Arg lie Tyr Phe Cys Gin lie Pro Cys Leu Asn Gly Gly Arg Cys lie 
405 410 415 

Gly Arg Asp Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 
420 425 430 

His Leu Pro Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 
435 440 445 

Arg His Arg Thr Leu Leu Glu Gly Pro Leu Lys Gin Ser Thr Phe Thr 
450 455 460 

Leu Pro Leu Ser Asn Gin Leu Ala Ser Val Asn Pro Ser Leu Val Lys 
465 470 475 480 

Val Gin He His His Pro Pro Glu Ala Ser Val Gin He His Gin Val 
485 490 495 

Ala Arg Val Arg Gly Glu Leu Asp Pro Val Leu Glu Asp Asn Ser Val 
500 505 510 

Glu Thr Arg Ala Ser His Arg Pro His Gly Asn Leu Gly His Ser Pro 
515 520 525 

Trp Ala Ser Asn Ser He Pro Ala Arg Ala Gly Glu Ala Pro Arg Pro 
530 535 540 

Pro Pro Val Leu Ser Arg His Tyr Gly Leu Leu Gly Gin Cys Tyr Leu 
545 550 555 560 

Ser Thr Val Asn Gly Gin Cys Ala Asn Pro Leu Gly Ser Leu Thr Ser 
565 570 575 

Gin Glu Asp Cys Cys Gly Ser Val Gly Thr Phe Trp Gly Val Thr Ser 
580_ _ 585 590 

Cys Ala Pro Cys Pro Pro Arg Gin Glu Gly Pro Ala Phe Pro Val He 
595 600 605 

Glu Asn Gly Gin Leu Glu Cys Pro Gin Gly Tyr Lys Arg Leu Asn Leu 
610 615 ' 620 

Ser His Cys Gin Asp He Asn Glu Cys Leu Thr Leu Gly Leu Cys Lys 
625 630 635 640 

Asp Ser Glu Cys Val Asn Thr Arg Gly Ser Tyr Leu Cys Thr Cys Arg 
645 650 655 

Pro Gly Leu Met Leu Asp Pro Ser Arg Ser Arg Cys Val Ser Asp Lys 
660 665 670 

Ala Val Ser Met Gin Gin Gly Leu Cys Tyr Arg Ser Leu Gly Ser Gly 
675 680 685 

Thr Cys Thr Leu Pro Leu Val His Arg He Thr Lys Gin He Cys Cys 
690 695 700 

Cys Ser Arg Val Gly Lys Ala Trp Gly Ser Thr Cys Glu Gin Cys Pro 
705 710 715 720 

Leu Pro Gly Thr Glu Ala Phe Arg Glu He Cys Pro Ala Gly His Gly 
725 730 735 
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Tyr Thr Tyr Ser Ser Ser Asp lie Arg Leu Ser Met Arg Lys Ala Glu 
740 745 750 

Glu Glu Glu Leu Ala Ser Pro Leu Arg Glu Gin Thr Glu Gin Ser Thr 
755 760 765 

Ala Pro Pro Pro Gly Gin Ala Glu Arg Gin Pro Leu Arg Ala Ala Thr 
770 775 780 

Ala Thr Trp lie Glu Ala Glu Thr Leu Pro Asp Lys Gly Asp Ser Arg 
785 790 795 800 

Ala Val Gin lie Thr Thr Ser Ala Pro His Le x Pro Ala Arg Val Pro 
805 810 815 

Gly Asp Ala Thr Gly Arg Pro Ala Pro Ser Leu Pro Gly Gin Gly lie 
820 825 830 

Pro Glu Ser Pro Ala Glu Glu Gin Val lie Pro Ser Ser Asp Val Leu 
835 840 845 

Val Thr His Ser Pro Pro Asp Phe Asp Pro Cys Phe Ala Gly Ala Ser 
850 855 860 

Asn lie Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 
865 870 875 880 

Cys Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 
885 890 895 

Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 
900 905 910 

Cys Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 
915 , _ _ ^920 925 

Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp lie Asp Glu 
930 935 940 

Cys Glu Gin Pro Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 
945 950 955 960 

Gly Ser Tyr His Cys Glu Cys Asp Arg Gly Tyr lie Met Val Arg Lys 
965 970 975 

Gly His Cys Gin Asp He Asn Glu Cys Arg His Pro Gly Thr Cys Pro 
980 985 990 

Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 
995 1000 1005 

Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 
1010 1015 1020 

Cys Leu Thr Pro Gly He Cys Thr His Gly Arg Cys He Asn Met Glu 
1025 1030 1035 1040 

Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 
1045 1050 1055 

Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 
1060 1065 1070 
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Pro Thr Gly Leu Cys Leu Asn Thr Glu Gly Ser Phe Thr Cys Ser Ala 
1075 1080 1085 

Cys Gin Ser Gly Tyr Trp Val Asn Glu Asp Gly Thr Ala Cys Glu Asp 
1090 1095 1100 

Leu Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Gly Val Cys Thr 
1105 1110 1115 1120 

Asn Thr Val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly Tyr Arg 
1125 1130 1135 

Pro Asn Pro " 3U Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 
1140 1145 1150 

Pro Gin Ser Ser Cys Arg Gly Gly Glu Cys Lys Asn Thr Glu Gly Ser 
1155 1160 1165 

Tyr Gin Cys Leu Cys His Gin Gly Phe Gin Leu Val Asn Gly Thr Met 
1170 1175 1180 

Cys Glu Asp Val Asn Glu Cys Val Gly Glu Glu His Cys Ala Pro His 
1185 1190 1195 1200 

Gly Glu Cys Leu Asn Ser Leu Gly Ser Phe Phe Cys Leu Cys Ala Pro 
1205 1210 1215 

Gly Phe Ala Ser Ala Glu Gly Gly Thr Arg Cys Gin Asp Val Asp Glu 
1220 1225 1230 

Cys Ala Ala Thr Asp Pro Cys Pro Gly Gly His Cys Val Asn Thr Glu 
1235 1240 1245 

Gly Ser Phe Ser Cys Leu Cys Glu Thr Ala Ser Phe Gin Pro Ser Pro 
1250 _ ^ _ 1?55_ . 1260 

Asp Ser Gly Glu Cys Leu Asp lie Asp Glu Cys Glu Asp Arg Glu Asp 
1265 1270 1275 1280 

Pro Val Cys Gly Ala Trp Arg Cys Glu Asn Ser Pro Gly Ser Tyr Arg 
1285 1290 1295 

Cys lie Leu Asp Cys Gin Pro Gly Phe Tyr Val Ala Pro Asn Gly Asp 
1300 1305 1310 

Cys lie Asp lie Asp Glu Cys Ala Asn Asp Thr Val Cys Gly Asn His 
1315 1320 1325 

Gly Phe Cys Asp Asn Thr Asp Gly Ser Phe Arg Cys Leu Cys Asp Gin 
1330 1335 1340 

Gly Phe Glu Thr Ser Pro Ser Gly Trp Glu Cys Val Asp Val Asn Glu 
1345 1350 1355 1360 

Cys Glu Leu Met Met Ala Val Cys Gly Asp Ala Leu Cys Glu Asn Val 
1365 1370 1375 

Glu Gly Ser Phe Leu Cys Leu Cys Ala Ser Asp Leu Glu Glu Tyr Asp 
1380 1385 1390 

Ala Glu Glu Gly His Cys Arg Pro Arg Val Ala Gly Ala Gin Arg lie 
1395 1400 1405 
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Pro Glu Val Arg Thr Glu Asp Gin Ala Pro Ser Leu lie Arg Met Glu 
1410 1415 1420 

Cys Tyr Ser Glu Kis Asn Gly Gly Pro Pro Cys Ser Gin lie Leu Gly 
1425 1430 1435 1440 

Gin Asn Ser Thr Gin Ala Glu Cys Cys Cys Thr Gin Gly Ala Arg Trp 
1445 1450 1455 

Gly Lys Ala Cys Ala Pro Cys Pro Ser Glu Asp Ser Val Glu Phe Ser 
1460 1465 1470 

Gin Leu Cys Pro Ser Gly Gin Gly Tyr He Pro Val Glu Gly Ala Trp 
:.475 1480 1485 

Thr Phe Gly Gin Thr Met Tyr Thr Asp Ala Asp Glu Cys Val Leu Phe 
1490 1495 1500 

Gly Pro Ala Leu Cys Gin Asn Gly Arg Cys Ser Asn He Val Pro Gly 
1505 1510 1515 1520 

Tyr lie Cys Leu Cys Asn Pro Gly Tyr His Tyr Asp Ala Ser Ser Arg 
1525 1530 1535 

Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 
1540 1545 1550 

Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
1555 1560 1565 

Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
1570 1575 1580 

Thr Glu Asp Phe Pro Asp His Asp He His Met Asp He Cys Trp Lys 
1585 1590 1595 1600 

Lys Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 

Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 
1635 1640 1645 

Asn Val Ala Arg He Glu Ala Glu Arg Gly Ala Gly He His Phe Arg 
1650 1655 1660 

Pro Gly Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 

Tyr Gly Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
1685 1690 1695 

Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1705 1710 

Asn Thr Pro Val Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1725 

His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 
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Gin Ala Glu Glu Cys Gly lie Leu Asn Gly Cys Glu Asn Gly Arg Cys 
1745 1750 1755 1760 



Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 
5 1765 1770 1775 

Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
1780 1785 1790 

10 Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 
1795 1800 1805 

Glu G 1 y Ser Tyr Arg Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pre 
1810 1815 1820 

15 

Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 



20 (2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 753 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



HI (ii) MOLECULE TYPE: other nucleic acid 

,H (A) DESCRIPTION: /desc = "DNA" 

SJ30 

IJf (ix) FEATURE: 

IP (A) NAME / KEY : CDS 

fQ (B) LOCATION: 1..3753 

^3$5 (xi) ^SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

£3 ATG CGC CAG GCC GCA TTG GGG CTG CTG GCA CTA CTC CTG CTG GCG CTG 4 8 

fir! Met Arg Gin Ala Ala Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala Leu 
15 10 15 



60 



CTG GGC CCC GGC GGC CGA GGG GTG GGC CGG CCG GGC AGC GGG GCA CAG 96 
Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala Gin 
20 25 30 



45 GCG GGG GCG GGG CGC TGG GCC CAA CGC TTC AAG GTG GTC TTT GCG CCT 144 

Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala Pro 
35 40 45 

GTG ATC TGC AAG CGG ACC TGT CTG AAG GGC CAG TGT CGG GAC AGC TGT 192 
50 Val lie Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser Cys 
50 55 60 

CAG CAG GGC TCC AAC ATG ACG CTC ATC GGA GAG AAC GGC CAC AGC ACC 24 0 

Gin Gin Gly Ser Asn Met Thr Leu lie Gly Glu Asn Gly His Ser Thr 
55 65 70 75 80 

GAC ACG CTC ACC GGT TCT GCC TTC CGC GTG GTG GTG TGC CCT CTA CCC 288 
Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu Pro 
85 90 95 



TGC ATG AAC GGT GGC CAG TGC TCT TCC CGA AAC CAG TGC CTG TGT CCC 33 6 

Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys Pro 
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100 



105 



110 



5 


CCG 
Pro 


GAT 
Asp 


TTC 
Phe 
115 


ACG 
Thr 


GGG 
Gly 


CGC 
Arg 


TTC 
Phe 


TGC 
Cys 
120 


CAG 
Gin 


GTG 
Val 


CCT 
Pro 


GCT 
Ala 


GCA GGA 
Ala Gly 
125 


ACC 
Thr 


GGA 
Gly 


384 


10 


GCT 
Ala 


GGC 
Gly 
130 


ACC 
Thr 


GGG 
Gly 


AGT 
Ser 


TCA 
Ser 


GGC 
Gly 
135 


CCC 
Pro 


GGC 
Gly 


TGG 
Trp 


CCC 
Pro 


GAC 
Asp 
140 


CGG GCC 
Arg Ala 


ATG 
Met 


TCC 
Ser 


432 




ACA 
Thr 
145 


GGC 
Gly 


CCG 
Pro 


CTG 
Leu 


CCG 
Pro 


CCC 
Pro 
150 


CTT 
Leu 


GCC 
Ala 


CCA 
Pro 


GAA 
Glu 


GGA GAG 
Gly Glu 
155 


TCT 
Ser 


GTG 
Val 


GCT 
Ala 


AGC 
Ser 
160 


480 


15 


AAA 
Lys 


CAC 
His 


GCC 
Ala 


ATT 
He 


TAC 
Tyr 
165 


GCG 
Ala 


GTG 
Val 


CAG 
Gin 


GTG 
Val 


ATC 
He 
170 


GCA 
Ala 


GAT 
Asp 


CCT 
Pro 


CCC 
Pro 


GGG 
Gly 
175 


CCG 
Pro 


528 


20 


GGG 
Gly 


GAG 
Glu 


GGT 
Gly 


CCT 
Pro 

i on 
Xo u 


CCT 
Pro 


GCA 
Ala 


CAA 
Gin 


CAT 
His 


GCA 
Ala 

IOC 

lob 


GCC 
Ala 


TTC 
Phe 


TTG 
Leu 


GTG 
Val 


CCC 
Pro 
190 


CTG 
Leu 


GGG 
Gly 


576 


A? 


CCA 
Pro 


GGA 
Gly 


CAA 
Gin 
195 


ATC 
He 


TCG 
Ser 


GCA 
Ala 


GAA 
Glu 


GTG 
Val 
200 


CAG 
Gin 


GCT 
Ala 


CCG 
Pro 


CCC 
Pro 


CCC 
Pro 
205 


GTG 
Val 


GTG 
Val 


AAC 
Asn 


624 


■ 

m 


GTG 
Val 


CGT 
Arg 
210 


GTC 
Val 


CAT 
His 


CAC 
His 


CCT 
Pro 


CCT 
Pro 
215 


GAA 
Glu 


GCT 
Ala 


TCC 
Ser 


GTT 
Val 


CAG 
Gin 


GTG 
Val 


CAC 
His 


CGC 
Arg 


ATC 
He 


672 


flJ 


GAG 
Glu 
225 


GGG 
Gly 


CCG 
Pro 


AAC 
Asn 


GCT 
Ala 


GAA 
Glu 
230 


GGC 
Gly 


CCA 
Pro 


GCC 
Ala 


TCT 
Ser 


TCC 
Ser 
235 


CAG 
Gin 


CAC 
His 


TTG 
Leu 


CTG 
Leu 


CCG 
Pro 
240 


720 




CAT 
His 


CCC 
Pro 


AAG 
Lys 


CCC 
Pro 


jCCG 
Pro 
245 


CAC 
His 


Pro"" 


j^GG 
Arg 


CCA 
Pro 


CCC 
Pro 
250 


ACT 
Thr 


CAA 
Gin 


AAG 
Lys 


CCA 
Pro 


CTG 
Leu 
255 


GGC 
Gly 


768 


Mo 

if 8 ? ; 

t y> 


CGC 
Arg 


TGC 
Cys 


TTC 
Phe 


CAG 
Gin 
260 


GAC 
Asp 


ACA 
Thr 


TTG 
Leu 


CCC 
Pro 


AAG 
Lys 
265 


CAG 
Gin 


CCT 
Pro 


TGT 
Cys 


GGC 
Gly 


AGC 
Ser 
270 


AAC 
Asn 


CCT 
Pro 


816 


45 


TTG 
Leu 


CCT 
Pro 


GGC 
Gly 
275 


CTT 
Leu 


ACC 
Thr 


AAG 
Lys 


CAG 
Gin 


GAA 
Glu 
280 


GAT 
Asp 


TGC 
Cys 


TGC 
Cys 


GGT 
Gly 


AGC 
Ser 
285 


ATC 
He 


GGT 
Gly 


ACT 
Thr 


864 


50 


GCC 
Ala 


TGG 
Trp 
290 


GGA 
Gly 


CAA 
Gin 


AGC 
Ser 


AAG 
Lys 


TGT 
Cys 
295 


CAC 
His 


AAG 
Lys 


TGC 
Cys 


CCA 
Pro 


CAG 
Gin 
300 


CTT 
Leu 


CAG 
Gin 


TAT 
Tyr 


ACA 
Thr 


912 




GGG 
Gly 
305 


GTG 
Val 


CAG 
Gin 


AAG 
Lys 


CCT 
Pro 


GTA 
Val 
310 


CCT 
Pro 


GTA 
Val 


CGT 
Arg 


Gly 


GAG 
Glu 
315 


GTG 
Val 


GGT GCT 
Gly Ala 


Asp 


rp/-l /-I 

1 bL 

Cys 
320 


y 6 u 


55 


CCC 
Pro 


CAG 
Gin 


GGC 
Gly 


TAC 
Tyr 


AAG 
Lys 
325 


AGG 
Arg 


CTC 
Leu 


AAC 
Asn 


AGC 
Ser 


ACC 
Thr 
330 


CAC 
His 


TGC 
Cys 


CAG 
Gin 


GAT 
Asp 


ATC 
He 
335 


AAC 
Asn 


1008 


60 


GAA 
Glu 


TGT 
Cys 


GCG 
Ala 


ATG 
Met 


CCC 
Pro 


GGG 
Gly 


AAT 
Asn 


GTG 
Val 


TGC 
Cys 


CAT 
His 


GGT GAC 
Gly Asp 


TGC 
Cys 


CTC 
Leu 


AAC 
Asn 


AAC 
Asn 


1056 



340 345 350 



-107- 



CCT GGC TCT TAT CGC TGT GTC TGC CCG CCC GGT CAT AGC TTG GGT CCC 1104 
Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly Pro 
355 360 365 

5 CTC GCA GCA CAG TGC ATT GCC GAC AAA CCA GAG GAG AAG AGC CTG TGT 1152 

Leu Ala Ala Gin Cys lie Ala Asp Lys Pro Glu Glu Lys Ser Leu Cys 
370 375 380 

TTC CGC CTT GTG AGC ACC GAA CAC CAG TGC CAG CAC CCT CTG ACC ACA 1200 
10 Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr Thr 

385 390 395 400 



15 



20 



CGC CTA ACC CGC CAG CTC TGC TGC TGT AGT GTG GGT AAA GCC TGG GGT 1248 
Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp Gly 
405 410 415 

GCC CGG TGC CAG CGC TGC CCG GCA GAT GGT ACA GCA GCC TTC AAG GAG 1296 
Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys Glu 
420 425 430 

ATC TGC CCC GGC TGG GAA AGG GTA CCA TAT CCT CAC CTC CCA CCA GAC 1344 
lie Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro Asp 
435 440 445 

GCT CAC CAT CCA GGG GGA AAG CGA CTT CTC CCT CTT CCT GCA CCC GAC 13 92 

% Ala His His Pro Gly Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro Asp 
^ 450 455 460 

tfi GGG CCA CCC AAA CCC CAG CAG CTT CCT GAA AGC CCC AGC CGA GCA CCA 1440 

|f| Gly Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala Pro 
465 470 475 480 

|D CCC CTC GAG GAC ACA GAG GAA GAG AGA GGA GTG ACC ATG GAT CCA CCA 14 88 

m Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro Pro 

35 ^ ^485 _ _ _ 490 495 

0 GTG AGT GAG GAG CGA TCG GTG CAG CAG AGC CAC CCC ACT ACC ACC ACC 1536 

||i Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr Thr 

y. 500 505 510 

S^. TCA CCC CCC CGG CCT TAC CCA GAG CTC ATC TCT CGC CCC TCC CCA CCT 1584 

|5 Ser Pro Pro Arg Pro Tyr Pro Glu Leu lie Ser Arg Pro Ser Pro Pro 
O 515 52 0 525 

45 ACC TTC CAC CGG TTC CTG CCA GAC TTG CCC CCA TCC CGA AGT GCA GTG 1632 

Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala Val 

530 535 540 

GAG ATC GCC CCC ACT CAG GTC ACA GAG ACC GAT GAG TGC CGA TTG AAC 1680 
50 Glu lie Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu Asn 

545 550 555 560 

CAG AAT ATC TGT GGC CAT GGA CAG TGT GTG CCT GGC CCC TCG GAT TAC 1728 
Gin Asn lie Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp Tyr 
55 565 570 575 

TCC TGC CAC TGC AAC GCT GGC TAC CGG TCA CAC CCG CAG CAC CGC TAC 1776 
Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg Tyr 
580 585 590 



60 



TGT GTT GAT GTG AAC GAG TGC GAG GCA GAG CCC TGC GGC CCC GGG AAA 1824 
Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly Lys 
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595 



600 



605 



GGC ATC TGT ATG AAC ACT GGT GGC TCC TAC AAT TGT CAC TGC AAC CGA 1872 
Gly lie Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn Arg 
610 615 620 

GGC TAC CGC CTC CAC GTG GGT GCA GGG GGC CGC TCG TGC GTG GAC CTG 1920 
Gly Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp Leu 
625 630 635 640 

AAC GAG TGC GCC AAG CCT CAC CTG TGT GGG GAC GGT GGC TTC TGC ATC 1968 
Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys He 
645 650 655 

15 AAC TTC CCT GGT CAC TAC AAA TGC AAC TGC TAT CCT GGC TAC CGG CTC 2 016 

Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu 
660 665 670 



10 



20 



MtO 



50 



60 



AAG GCC TCC CGA CCG CCC ATT TGC GAA GAC ATC GAC GAG TGT CGC GAC 2 064 

Lys Ala Ser Arg Pro Pro He Cys Glu Asp He Asp Glu Cys Arg Asp 
675 680 685 



CCT AGC ACC TGC CCT GAT GGC AAA TGT GAA AAC AAA CCT GGC AGC TTC 2112 

Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser Phe 
f& P 690 695 700 

yQ AAG TGC ATC GCC TGC CAG CCT GGC TAC CGT AGC CAG GGG GGC GGG GCC 216 0 

Ifl L Y S C Y S Ile Ala c ys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly Ala 

:H A 705 710 715 720 

Ill TGT CGT GAT GTC AAC GAA TGC TCC GAA GGT ACC CCC TGC TCT CCT GGA 2208 

Ell Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro Gly 

IS 725 730 735 



TGG TGT GAG AAAJTTT CCG GGTJTCT TAC CGT TGC ACG TGT GCC CAG GGG 2256 

Trp Cys Glu Lys Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin Gly 
740 745 750 

ATA CGA ACC CGC ACA GGA CGC CTC AGT TGC ATA GAC GTG GAT GAC TGT 23 04 

He Arg Thr Arg Thr Gly Arg Leu Ser Cys lie Asp Val Asp Asp Cys 
755 760 765 



^ GAG GCT GGG AAA GTG TGC CAA GAT GGC ATC TGC ACG AAC ACA CCA GGC 2352 

Glu Ala Gly Lys Val Cys Gin Asp Gly Ile Cys Thr Asn Thr Pro Gly 
45 770 775 780 



TCT TTC CAG TGT CAG TGC CTC TCC GGC TAT CAT CTG TCA AGG GAT CGG 2400 
Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp Arg 
7 ^5 790 795 800 

AGC CGC TGT GAG GAC ATT GAT GAA TGT GAC TTC CCT GCG GCC TGC ATC 2448 
Ser Arg Cys Glu Asp Ile Asp Glu Cys Asp Phe Pro Ala Ala Cys Ile 
805 810 815 



55 GGG GGT GAC TGC ATC AAT ACC AAT GGT TCC TAC AGA TGT CTC TGT CCC 

Gly Gly Asp Cys Ile Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys Pro 
820 825 830 



2496 



CTG GGT CAT CGG TTG GTG GGC GGC AGG AAG TGC AAG AAA GAT ATA GAT 2544 
Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He Asp 
835 840 845 
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10 



GAG TGC TAC CTT AAC TTC GAT GAC ACA GTG TTC TGT GAC AGC GTA TTG 
Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val Leu 
15 900 — - 



20 



io 



45 



50 



60 



905 910 



GAG GAG AGG ACG GCA TGT GTA TGG GGC CCC TGG GCG GGA CCT GCC CTC 
Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro Ala Leu 
1075 1080 1085 

ACT TTT GAT GAC TGC TGC TGC CGC CAG CCG CGG CTG GGT ACC CAG TGC 
Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr Gin Cys 
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2592 



2640 



GAG TGC AGC CAG GAC CCA GGC CTG TGC CTG CCC CAT GCC TGC GAG AAC 
Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu Asn 
850 855 860 

CTC CAG GGC TCC TAT GTC TGT GTC TGT GAT GAG GGT TTC ACA CTC ACC 
Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu Thr 
865 870 875 880 

CAG GAC CAG CAT GGG TGT GAG GAG GTG GAG CAG CCC CAC CAC AAG AAG 2688 
Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys Lys 
885 890 895 
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GCT ACC AAT GTC ACT CAG CAG GAA TGC TGT TGC TCT CTG GGA GCT GGC 
Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala Gly 
915 920 925 

TGG GGA GAC CAC TGC GAA ATC TAT CCC TGT CCA GTC TAC AGC TCA GCC 2832 
Trp Gly Asp His Cys Glu He Tyr Pro Cys Pro Val Tyr Ser Ser Ala 
930 935 940 

r |5 GAA TTT CAC AGC CTG GTG CCT GAT GGG AAA AGG CTA CAC TCA GGA CAA 

*£ Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly Gin 

m 945 950 955 960 

CAA CAT TGT GAA CTA TGC ATT CCT GCC CAC CGT GAC ATC GAC GAA TGC 
Gin His Cys Glu Leu Cys He Pro Ala His Arg Asp He Asp Glu Cys 
965 970 975 

IB ATA TTG TTT GGG GCA GAG ATC TGC AAG GAG GGC AAG TGT GTG AAC TCG 2 976 

He Leu Phe Gly Ala Glu He Cys Lys Glu Gly Lys Cys Val Asn Ser 
*®J _ 980 _ 985 990 



2880 



2928 



CAG CCC GGC TAC GAG TGC TAC TGC AAG CAG GGC TTC TAC TAC GAT GGC 3 024 

Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp Gly 
995 1000 1005 

AAC CTG CTG GAG TGC GTG GAC GTG GAC GAG TGC TTG GAT GAG TCT AAC 3072 
Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn 
1010 1015 1020 

TGC AGG AAC GGA GTG TGT GAG AAC ACG TGG CGG CTA CCG TGT GCC TGC 3120 
Cys Arg Asn Gly Val Cys Glu Asn Thr Trp Arg Leu Pro Cys Ala Cys 
102 5 1030 1035 1040 

ACT CCG CCG GCA GAG TAC AGT CCC GCA CAG GCC CAG TGT CTG ATC CCG 3168 
Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu He Pro 
1045 1050 1055 

GAG AGA TGG AGC ACG CCC CAG AGA GAC GTG AAG TGT GCT GGG GCC AGC 3216 
Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cys Ala Gly Ala Ser 
^ 1060 1065 1070 



3264 



3312 



0 



1090 1095 1100 

AGA CCG TGC CCG CCA CGT GGC ACC GGG TCC CAG TGC CCG ACT TCA CAG 3360 
Arg Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin 
,1105 1110 1115 1120 

AGT GAG AGC AAT TCT TTC TGG GAC ACA AGC CCC CTG CTA CTG GGG AAG 3408 
Ser Glu Ser Asn Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys 
1125 1130 1135 

TCT CCG CGA GAC GAA GAC AGC TCA GAG GAG GAT TCA GAT GAG TGC CGT 3456 
Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg 
1140 1145 1150 

15 TGT GTG AGC GGA CCG TGT GTG CCA CGG CCA GGC GGG GCG GTA TGC GAG 3504 

Cys Val Ser Gly Pro Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu 
1155 1160 1165 

TGT CCT GGA GGC TTT CAG CTG GAC GCC TCC CGT GCC CGC TGC GTG GAC 3 552 

Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp 
1170 1175 1180 

ATT GAT GAG TGC CGA GAA CTG AAC CAG CGG GGA CTG CTG TGT AAG AGC 3600 
He Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser 
*P ll 85 1190 1195 1200 

Q GAG CGG TGC GTG AAC ACC AGT GGA TCC TTC CGC TGT GTC TGC AAA GCT 364 8 

0 Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala 

1205 1210 1215 

GGC TTC ACG CGC AGC CGC CCT CAC GGG CCT GCG TGC CTC AGC GCC GCC 3696 
Gly Phe Thr Arg Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala 
1220 1225 1230 



20 



Bo 



*S5 GCT GAT GAT J3CAJ3CC ATA GCC CAC ACC TCA GTG ATC GAT CAT CGA GGG 

* Ala Asp Asp Ala Ala He Ala His Thr Ser Val He Asp His Arg Gly 

D 1235 1240 1245 



45 



50 



55 



60 



3744 



TAT TTT CAC 3753 
Tyr Phe His 
1250 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Arg Gin Ala Ala Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala Leu 
1 5 10 15 

Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala Gin 
20 25 30 

Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala Pro 
35 40 45 
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Val lie Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser Cys 
50 55 60 

Gin Gin Gly Ser Asn Met Thr Leu lie Gly Glu Asn Gly His Ser Thr 
65 70 75 80 

Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu Pro 
85 90 95 

Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys Pro 
100 105 110 

Pro Aop Fhe r "ir Gly Arg Phe Cys Gin Val Pr:> Ala Ala Gly Thr Gly 
115 120 125 

Ala Gly Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met Ser 
130 135 140 

Thr Gly Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala Ser 
145 150 155 160 

Lys His Ala lie Tyr Ala Val Gin Val lie Ala Asp Pro Pro Gly Pro 
165 170 175 

Gly Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu Gly 
180 185 190 

Pro Gly Gin He Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val Asn 
195 200 205 

Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg He 
210 215 220 

Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu Pro 
225 _ 230 235 240 

His Pro Lys Pro Pro His Pro Arg Pro Pro Thr Gin Lys Pro Leu Gly 
245 250 255 

Arg Cys Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn Pro 
260 265 270 

Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser He Gly Thr 
275 280 285 

Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr Thr 
290 295 300 

Gly Val Gin Lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp Cys 
305 310 315 320 

Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp He Asn 
325 330 335 

Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn Asn 
340 345 350 

Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly Pro 
355 360 365 

Leu Ala Ala Gin Cys He Ala Asp Lys Pro Glu Glu Lys Ser Leu Cys 
370 375 380 
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Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr Thr 
385 390 395 400 

Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp Gly 
405 410 415 

Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys Glu 
420 425 430 

lie Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro Asp 
435 440 445 

Ala His His Pro Gly Gly Lvs Arg Leu Leu Pro Leu Pro Ala Pro Asp 
450 455 460 

Gly Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala Pro 
4 65 470 475 480 

Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro Pro 
485 490 495 

Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr Thr 
500 505 510 

Ser Pro Pro Arg Pro Tyr Pro Glu Leu He Ser Arg Pro Ser Pro Pro 
515 520 525 

Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala Val 
530 535 540 

Glu He Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu Asn 
54 5 550 555 560 

Gin Asn He Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp Tyr 
J565 _ 570 575 

Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg Tyr 
580 585 590 

Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly Lys 
595 600 605 

Gly He Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn Arg 
610 615 620 

Gly Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp Leu 
625 630 635 640 

Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys He 
645 650 655 

Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu 
660 665 670 

Lys Ala Ser Arg Pro Pro He Cys Glu Asp He Asp Glu Cys Arg Asp 
675 680 685 

Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser Phe 
690 695 700 

Lys Cys He Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly Ala 
705 710 715 720 
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Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro Gly 
725 730 735 

Trp Cys Glu Lys Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin Gly 
740 745 750 

lie Arg Thr Arg Thr Gly Arg Leu Ser Cys He Asp Val Asp Asp Cys 
755 760 765 

Glu Ala Gly Lys Val Cys Gin Asp Gly He Cys Thr Asn Thr "Pro Gly 
770 775 780 

Ser Phe Gin Cyr Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp Arg 
785 790 795 800 

Ser Arg Cys Glu Asp He Asp Glu Cys Asp Phe Pro Ala Ala Cys lie 
805 810 815 

Gly Gly Asp Cys He Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys Pro 
820 825 830 

Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He Asp 
835 840 845 

Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu Asn 
850 855 860 

Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu Thr 
8^5 870 875 880 

Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys Lys 
885 890 895 

Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val Leu 
900_ ^ 905 910 

Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala Gly 
915 920 925 

Trp Gly Asp His Cys Glu He Tyr Pro Cys Pro Val Tyr Ser Ser Ala 
930 935 940 

Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly Gin 
945 950 955 960 

Gin His Cys Glu Leu Cys He Pro Ala His Arg Asp He Asp Glu Cys 
965 970 975 

He Leu Phe Gly Ala Glu He Cys Lys Glu Gly Lys Cys Val Asn Ser 
980 985 990 

Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp Gly 
995 1000 1005 

Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn 
1010 1015 1020 

Cys Arg Asn Gly Val Cys Glu Asn Thr Trp Arg Leu Pro Cys Ala Cys 
1025 1030 1035 1040 

Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu He Pro 
1045 1050 1055 
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Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cys Ala Gly Ala Ser 
1060 1065 1070 

Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro Ala Leu 
1075 1080 1085 

Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr Gin Cys 
1090 1095 1100 

Arg Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin 
1105 1110 1115 1120 

Ser Glu Ser As^ Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys 
1125 1130 1135 

Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg 
1140 1145 1150 

Cys Val Ser Gly Pro Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu 
1155 1160 1165 

Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp 
1170 1175 1180 

lie Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser 
1185 1190 1195 1200 

Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala 
1205 1210 1215 

Gly Phe Thr Arg Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala 
1220 1225 1230 

Ala Asp Asp Ala Ala lie Ala His Thr Ser Val lie Asp His Arg Gly 
x ? 35 ^ _ . _1? 4 .° 1245 

Tyr Phe His 
1250 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Gly Glu Ser Val Ala Ser Lys His Ala lie Tyr Ala Val Cys 
15 10 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

TACCGATGCT ACCGCAGCAA TCTT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 

ATGCCTAAAC TCTACCAGCA CG 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION! 73esc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 

GAGTCACGTC ATCCATTCCA CA 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

CGTCCAAGTT GTGTCTTAGC AG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 
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<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Pro Pro Gly Pro Gin Gly Ala Thr Gly Pro Leu Gly Pro Lys Gly 
15 10 15 

Gin Thr Gly Glu Pro Gly lie Ala Gly Phe Lys Gly Glu Gin Gly Pro 
20 25 30 

Lys Gly Glu Thr Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Pro Ala 
35 40 45 

Gly Glu Glu Gly Lys 
50 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = 11 DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCCCTCCCG GTCCTCAAGG TGCAACTGGT CCTCTGGGCC CCAAAGGTCA GACGGGTGAG 

CCCGGCATCG CTGGCTTCAA AGGTGAACAA GGCCCCAAGG GAGAGACTGG ACCTGCTGGG 

CCCCAGGGAG CCCCTGGCCC TGCTGGTGAA GAAGGAAAA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAACGTCACA CGTGANACGT GAACGTTGCT TGCTGG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TTACGTCCAC GTACACGTCT AGCAAGCAAG CA 
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WHAT IS CLAIMED IS: 



1. A DNA segment comprising an isolated gene that encodes an LTBP-2 or LTBP-3 
protein. 



2. The DNA segment of claim 1, comprising an isolated gene that encodes an LTBP-2 
protein. 



3. The DNA segment of claim 2, comprising an LTBP-2 gene-that encodes an LTBP-2 
protein or peptide that includes a contiguous amino acid sequence..from SEQ ID NO:2. 

4. The DNA segment of claim 3, comprising an LTBP-2 gene that includes a contiguous 
nucleic acid sequence from between position 1 and position 5499 of SEQ ID NO:l. 

5. The DNA segment of claim 3, comprising an LTBP-2 gene that encodes an LTBP-2 
protein of about 1833 amino acids in length. 

6. The DNA segment of claim 5, comprising an LTBP-2 gene that includes the 
contiguous nucleic acid sequence from position 1 to position 5499 of SEQ ID NO:l. 

7. The DNA segment of claim 1, comprising an isolated gene that encodes an LTBP-3 
protein. 
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8. The DNA segment of claim 7, comprising an LTBP-3 gene that encodes an LTBP-3 
protein or peptide that includes a contiguous amino acid sequence from SEQ ID NO:4. 

9. The DNA segment of claim 8, comprising an LTBP-3 gene that includes a contiguous 
nucleic acid sequence from between position 1 and position 3753 of SEQ ID NO: 3. 

10. The DNA segment of claim 8, comprising an LTBP-3 gene that encodes an LTBP-3 
protein of about 1251 amino acids in length. 

11. The DNA segment of claim 10, comprising an LTBP-3 gene that includes the 
contiguous nucleic acid sequence from position 1 to position 3753 of SEQ ID NO: 3. 

12. The DNA segment of claim 1, positioned under the control of a promoter. 

13. The DNA segment of claim 12, positioned under the control of a recombinant 
promoter. 

14. The DNA segment of claim 12, wherein the isolated gene is positioned, in reverse 
orientation, under the control of a promoter that directs the expression of an antisense 
product. 

15. The DNA segment of claim 13, further defined as a recombinant vector. 
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16. A recombinant host cell comprising a DNA segment that comprises an isolated gene 
that encodes an LTBP-2 or LTBP-3 protein or peptide. 

17. The recombinant host cell of claim 15, further defined as a prokaryotic host cell. 

18. The recombinant host cell of claim 15 , further defined as a eukaryotic host cell. 

19. The recombinant host cell of claim 15, wherein the DNA segment is introduced into 
the cell by means of a recombinant vector and the host cell expresses the DNA segment to 
produce the encoded LTBP-2 or LTBP-3 protein or peptide. 

20. The recombinant host cell of claim 18, wherein the expressed LTBP-2 or LTBP-3 
protein or peptide includes a contiguous amino acid sequence from SEQ ID NO: 2 or SEQ ID 
NO:4. 

21. A method of using a DNA segment that includes an isolated gene that encodes an 
LTBP-2 or LTBP-3 protein or peptide, comprising the steps of: 

(a) preparing a recombinant vector in which an LTBP-2- or LTBP-3- 
encoding DNA segment is positioned under the control of a promoter; 

(b) introducing said recombinant vector into a recombinant host cell; 

(c) culturing the recombinant host cell under conditions effective to allow 
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expression of an encoded LTBP-2 or LTBP-3 protein or peptide; and 
(d) collecting said expressed LTBP-2 or LTBP-3 protein or peptide. 

5 

22. A recombinant LTBP-2 or LTBP-3 protein or peptide prepared by expressing an LTBP-2- 
or LTBP-3-encoding DNA segment in a recombinant host cell and purify iag the expressed LTBP-2- 
or LTBP-3 protein or peptide away from total recombinant host cell components. 

23. A method for detecting an LTBP-2 or LTBP-3 nucleic acid segment in a sample, 
comprising the steps of: 

(a) obtaining sample nucleic acids from a sample suspected of containing 
an LTBP-2 or LTBP-3 nucleic acid segment; 

(b) contacting said sample nucleic acids with an isolated LTBP-2 or 
LTBP-3 nucleic acid segment under conditions effective to allow 
hybridization of substantially complementary nucleic acids; and 

(c) detecting the hybridized complementary nucleic acids thus formed. 

24. A nucleic acid segment characterized as: 
25 

(a) a nucleic acid segment comprising a sequence region that consists of at 
least 17 contiguous nucleotides that have the same sequence as, or are 
complementary to, 17 contiguous nucleotides of SEQ ID NO:l or SEQ 
ID NO:3; or 

30 
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(b) a nucleic acid segment of from 17 to about 10,000 nucleotides in length 
that hybridizes to the nucleic acid segment of SEQ ID NO:l or SEQ ID 
NO: 3, or a complement thereof, under standard hybridization 
conditions. 

25. The nucieic acid segment of claim 23, wherein the segment comprises a sequence 
region of at least 17 contiguous nucleotides from SEQ ID NO:l or SEQ ID NO: 3 or a 
complement thereof. 

26. The nucleic acid segment of claim 23, wherein the segment hybridizes to the nucleic 
acid segment of SEQ ID NO:l or SEQ ID NO: 3 or a complement thereof. 

27. The nucleic acid segment of claim 23, wherein the segment comprises a sequence 
region of at least 17 contiguous nucleotides from SEQ ID NO:l, or the complement thereof; 
or wherein the segment hybridizes to the nucleic acid segment of SEQ ID NO:l, or the 
complement thereof. 

28. The nucleic acid segment of claim 23, wherein the segment comprises a sequence 
region of at least 17 contiguous nucleotides from SEQ ID NO: 3, or the complement thereof; 
or wherein the segment hybridizes to the nucleic acid segment of SEQ ID NO: 3, or the 
complement thereof. 

29. The nucleic acid segment of claim 23, wherein the segment comprises a sequence 
region of at least about 25 nucleotides; or wherein the segment is about 25 nucleotides in 
length. 
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30. The nucleic acid segment of claim 28, wherein the segment comprises a sequence 
region of at least about 50 nucleotides; or wherein the segment is about 50 nucleotides in 
length. 

31. The nucHc acid segment of claim 29, wherein the segment comprises a sequence 
region of at least about 100 nucleotides; or wherein the segment is about 100 nucleotides in 
length. 

32. The nucleic acid segment of claim 30, wherein the segment comprises a sequence 
region that consists of the 5499 contiguous nucleotides of SEQ ID NO:l, or the complement 
thereof. 

33. The nucleic acid segment of claim 30, wherein the segment comprises a sequence 
region that" consistsTof the 3753 contiguous nucleotides of SEQ ID NO:3, or the complement 
thereof. 

34. The nucleic acid segment of claim 23, wherein the segment is up to about 10,000 
basepairs in length. 

35. The nucleic acid segment of claim 33, wherein the segment is up to 5,000 basepairs 
in length. 

36. The nucleic acid segment of claim 23, further defined as a RNA segment. 
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37. A purified antibody that binds to an LTBP-3 protein or peptide. 



38. The antibody of claim 37, wherein the antibody is linked to a detectable label. 



39. An immunodetection kit comprising, in suitable container means, an LTBP-3 protein 
or peptide, or a first antibody that binds to an LTBP-3 protein or peptide, and an 
immunodetection reagent. 



40. A method for identifying a transforming growth factor 0 protein in a sample, 
comprising contacting said sample with an LTBP-2 or LTBP-3 protein under conditions 
effective to allow binding and detecting the protein so bound. 



41. A method for purifying transforming growth factor 0 (TGF/8) protein in a sample, 
comprising: 

(a) contacting a sample suspected of containing TGF/S 0 protein with an 
LTBP-2 or LTBP-3 protein under conditions effective to allow specific 
binding; and 

(b) collecting the bound TGF/3, substantially free from the non-bound 
components. 
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42. 



A composition comprising a purified murine LTBP-2 or LTBP-3 polypeptide. 



a 
m 
m 

m 
m 
m 
m 

a 
m 

ru 
o 

o 



-126- 



ABSTRACT OF THE DISCLOSURE 



Disclosed are novel nucleic acid and peptide compositions comprising latent TGFjS 
binding proteins (LTBPs). Also disclosed are methods of using LTBP-2 and LTBP-3 
peptides and the DNA segments which encode them. 
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PLTTRLTRQ 

TGG GGT GCC CGG TGC CAG CGC TGC CCG 
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TCT CGC CCC TCC CCA CCT ACC TTC CAC 
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CGA AGT GCA GTG GAG ATC GCC CCC ACT 
RSAVEIAPT 

TTG AAC CAG AAT ATC TGT GGC CAT GGA 
LNQNICGHG 

TCC TGC CAC TGC AAC GCT GGC TAC CGG 
SCHCNAGYR 

GAT GTG AAC GAG TGC GAG GCA GAG CCC 
DVNECEAEP 

AAC ACT GGT GGC TCC TAC AAT TGT CAC 
NTGGSYNCH 

GGT GCA GGG GGC CGC TCG TGC GTG GAC 
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TGT GGG GAC GGT GGC TTC TGC ATC AAC 
CGDGGFCIN 

TAT CCT GGC TAC CGG CTC AAG GCC TCC 
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NPGS YRCVC 360 
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CCA GTG AGT GAG GAG CGA TCG GTG CAG 1512 
PVSEERSVQ 504 

CCC CCC CGG CCT TAC CCA GAG CTC ATC 1566 
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AGC TTC AAG TGC ATC GCC TGC CAG CCT GGC TAC CGT AGC CAG GGG GGC GGG GCC 2160 
SFKCIACQPGYRSQGGGA 720 



TGT CGT GAT GTC AAC GAA TGC TCC GAA 
CRDVNECSE 

GAG AAA CTT CCG GGT TCT TAC CGT TGC 
EKLPGSYRC 

ACA GGA CGC CTC AGT TGC ATA GAC GTG 
TGRLSCIDV 

CAA GAT GGC ATC TGC ACG AAC ACA CCA 
QDGICTNTP 

GGC TAT CAT CTG TCA AGG GAT CGG AGC 
GYHLSRDRS 

TTC CCT GCG GCC TGC ATC GGG GGT GAC 
FPAACIGGD 

TGT CTC TGT CCC CTG GGT CAT CGG TTG 
CLCPLGHRL 

ATA GAT GAG TGC AGC CAG GAC CCA GGC 
IDECSQDPG 

CTC CAG GGC TCC TAT GTC TGT~GTC TGT 
LQGSYVCVC 

CAG CAT GGG TGT GAG GAG GTG GAG CAG 
QHGCEEVEQ 

AAC TTC GAT GAC ACA GTG TTC TGT GAC 
NFDDTVFCD 

CAG GAA TGC TGT TGC TCT CTG GGA GCT 
QECCCSLGA 

CCC TGT CCA GTC TAC AGC TCA GCC GAA 
PCPVYSSAE 

AGG CTA CAC TCA GGA CAA CAA CAT TGT 
RLHSGQQHC 

ATC GAC GAA TGC ATA TTG TTT GGG GCA 
IDECILFGA 

AAC TCG CAG CCC GGC TAC GAG TGC TAC 
NSQPGYECY 

AAC CTG CTG GAG TGC GTG GAC GTG GAC 
NLLECVDVD 

AAC GGA GTG TGT GAG AAC ACG TGG CGG 
NGVCENTWR 

GAG TAC AGT CCC GCA CAG GCC CAG TGT 
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ASSIGNMENT: Concurrently filed 



PATENT 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Application of: 
Jeffrey Bonadio 
• Wushan Yin 

Serial No.: 08/479,722 

Filed: June 7, 1995 



For: LATENT TGFp BINDING PROTEIN 
(LTBP) GENES, COMPOSITIONS 
AND METHODS 




§ 
§ 
§ 
§ 
§ 
§ 



Examiner: David L. Fitzgerald 
Group Art Unit: 1646 
Atty.Dkt: 4100.000500 
Formerly: UMIC:013 



ELECTION UNDER 37 C.F.R. §§ 3.71 AND 3.73 
AND POWER OF ATTORNEY 



Assistant Commissioner for Patents 
Washington, D.C. 20231 



Sir: 



The undersigned, being Assignee of record of the entire interest in the above-identified 
application by virtue of an assignment recorded in the United States Patent and Trademark 
Office as set forth below, hereby elects, under 37 C.F.R. § 3.71, to prosecute the application 
to the exclusion of the inventor. 

The Assignee hereby revokes any previous Powers of Attorney and appoints: 

Danny L, Williams, Reg. No. 31,892; Terry D. Morgan, Reg. No, 31,181; J. 
Mike Amerson, Reg. No. 35,426; Kenneth D. Goodman, Reg. No. 30,460; 
Barbara S. Kitchell, Reg. No. 33,928; Jeffrey A. Pyle, Reg. No. 34,904; 
Randall C. Furlong, Reg. No. 35,144; Scott F. Diring, Reg. No. 35,119; 
Jeffrey E. Hundley, Reg. No. 42,676; George J. Oehling, Reg. No. 40,471; 
Shelley P.M. Fussey, Reg. No. 39,458; David W. Hibler, Reg. No. 41,071; 
and Mark D. Moore, Reg. No. 42,903. 

each an attorney or agent with the law firm of WILLIAMS, MORGAN & AMERSON, P.C., as 
its attorney or agent so long as they remain with such law firm, with full power of substitution 
and revocation, to prosecute the application, to make alterations and amendments therein, to 
transact all business in the Patent and Trademark Office in connection therewith, and to receive 
any Letters Patent, and for one year after issuance of such Letters Patent to file any request for a 
certificate of correction that may be deemed appropriate. 



New Power of Attorney 



Pursuant to 37 C.F.R. § 3.73, the undersigned has reviewed the evidentiary documents, 
specifically the Assignment to The Regents of The University of Michigan, referenced 
below, and certifies that to the best of my knowledge and behalf, title remains in the name of 

the Assignee. 

Please direct all communications as follows: 

Shelley P.M. Fussey 

WILLIAMS, MORGAN & AMERSON, P.C. 
7676 Hillmont, Suite 250 
Houston, Texas 77040 
(713) 934-7000 
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JUN 1 2 20QG 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Bonadio, Jeffrey 
Yin, Wushan 

(ii) TITLE OF INVENTION: LATENT TGF ( BINDING PROTEIN (LTBP) 
GENES, COMPOSITIONS AND METHODS 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Williams, Morgan & Amerson 

(B) STREET: 7676 Hillmont, Suite 250 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: USA 

(F) ZIP: 77040 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

fi' (C) -OPERATING SYSTEM: PC-DOS/MS-DOS 

Uj (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

Pfj (vi) CURRENT APPLICATION DATA: 

fff (A) APPLICATION NUMBER: US 08/479, 722 

(B) FILING DATE: 07-JUN-1995 
% (C) CLASSIFICATION: Unknown 



^ (vii) PRIOR APPLICATION DATA: 

y (A) APPLICATION NUMBER: US PCT/US95 / 022 51 

Si (B) FILING DATE: 21-FEB-1995 

ft) (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/316,650 

(B) FILING DATE: 30-SEP-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/199,780 

(B) FILING DATE: 18-FEB-1994 

(viii) ATTORNEY/AGENT INFORMATION: 

(A} NAME: Fussey, Shelley P.M. 

(B) REGISTRATION NUMBER: 39,458 

(C) REFERENCE/ DOCKET NUMBER: 4 100 . 000500/FUS 

(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (713) 934-7000 

(B) TELEFAX: (713) 934-7011 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5499 base pairs 



(B) 
(C) 
(D) 



TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



Hi 



(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B* T OCATION: 1..54 99 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GAG AGC ACC TCC CCG CGA GGT CTC CGG TGC CCA CAG CTC TGC AGC 48 
Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin Leu Cys Ser 
15 10 15 

CAC TCT GGC GCC ATG AGA GCG CCG ACC ACC GCT CGC TGC TCC GGA TGC 96 
His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys Ser Gly Cys 
20 25 30 

ATC CAA CGG GTG CGT TGG AGG GGC TTC CTG CCA CTT GTC CTG GCT GTC 14 4 

lie Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val Leu Ala Val 
35 40 45 

TTG ATG GGG ACA AGT CAT GCC CAA CGG GAT TCC ATA GGG AGA TAC GAA 192 
Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser lie Gly Arg Tyr Glu 
50 55 60 

CCA GCT AGC AGG GAT GCG AAT CGG TTG TGG CAC CCC GTG GGC AGC CAC 240 
Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val Gly Ser His 
65 70 75 80 

CCC GCA GCG GCT GCA GCC AAG GTG TAC AGT CTG TTC CGA GAG CCT GAC 28 8 

Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg Glu Pro Asp 
85 90 95 

GCG CCG GTC CCC GGC TTG TCG CCC TCT GAG TGG AAC CAG CCG GCC CAG 336 
Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin Pro Ala Gin 
100 105 110 

GGG AAC CCG GGA TGG CTC GCA GAG GCC GAG GCC AGG AGG CCA CCT CGA 38 4 

Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg Pro Pro Arg 
115 120 125 

ACC CAG CAG CTG CGT CGA GTC CAG CCA CCT GTC CAG ACT CGG AGA AGC 4 32 

Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr Arg Arg Ser 
130 135 140 

CAT CCC CGG GGC CAG CAG CAG ATA GCA GCC CGG GCT GCA CCT TCT GTC 480 
His Pro Arg Gly Gin Gin Gin He Ala Ala Arg Ala Ala Pro Ser Val 
145 150 155 160 

GCG CGC CTG GAA ACC CCT CAG CGA CCC GCG GCT GCA CGG CGA GGG CGG 528 
Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 

165 170 175 

CTC ACT GGG AGA AAT GTC TGC GGG GGA CAG TGC TGC CCA GGA TGG ACA 57 6 



Leu Thr Gly Arg Asn Val Cys Gly Gly Gin Cys Cys Pro Gly Trp Thr 
180 185 190 



ACA TCA AAC AGC ACC AAC CAC TGT ATC AAA CCT GTG TGT CAG CCT CCC 62 4 

Thr Ser Asn Ser Thr Asn His Cys He Lys Pro Val Cys Gin Pro Pro 
195 200 205 

TGT CAG AAC CGA GGC TCC TGC AGC AGG CCC CAG GTC TGC ATC TGC CGT 672 
Cys Gin Asn Arg Gly Ser Cys Ser Arg Pro Gin Val Cys He Cys Arg 
210 215 220 

TCT GGC TTC CGT GGG GCG CGC TGT GAG GAG GTC Arc CCT GAG GAG GAA 720 
Ser Gly Phe Arg Gly Ala Arg Cys Glu Glu Val He Pro Glu Glu Glu 
225 230 235 240 

TTT GAC CCT CAG AAT GCC AGG CCT GTG CCC AGA CGC TCA GTG GAG AGA 7 68 

Phe Asp Pro Gin Asn Ala Arg Pro Val Pro Arg Arg Ser Val Glu Arg 
245 250 255 

GCA CCC GGT CCT CAC AGA AGC AGT GAG GCC AGA GGA AGT CTA GTG ACC 816 
Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg Gly Ser Leu Val Thr 
260 265 270 

AGA ATA CAG CCG CTG GTA CCA CCA CCA TCA CCA CCT CCA TCT CGG CGC 8 64 

Arg He Gin Pro Leu Val Pro Pro Pro Ser Pro Pro Pro Ser Arg Arg 
275 280 285 

CTC AGC CAG CCC TGG CCC CTG CAG CAG CAC TCA GGG CCG TCC AGG ACA 912 
Leu Ser Gin Pro Trp Pro Leu Gin Gin His Ser Gly Pro Ser Arg Thr 
290 295 300 

GTT CGT CGG TAT CCG GCC ACT GGT GCC AAT GGC CAG CTG ATG TCC AAC 960 
Val Arg Arg Tyr Pro Ala Thr Gly Ala Asn Gly Gin Leu Met Ser Asn 
yl 305 310 315 320 

GCT TTG CCT TCA GGA CTC GAG CTG AGA GAC AGC AGC CCA CAG GCA GCA 1008 
Ala Leu Pro Ser Gly Leu Glu Leu Arg Asp Ser Ser Pro Gin Ala Ala 
325 330 335 

CAT GTG AAC CAT CTC TCA CCC CCC TGG GGG CTG AAC CTC ACC GAG AAA 1056 
His Val Asn His Leu Ser Pro Pro Trp Gly Leu Asn Leu Thr Glu Lys 
340 345 350 

ATC AAG AAA ATC AAA GTC GTC TTC ACC CCC ACC ATC TGC AAG CAG ACC 1104 
He Lys Lys He Lys Val Val Phe Thr Pro Thr He Cys Lys Gin Thr 
355 360 365 

TGT GCC CGG GGA CGC TGT GCC AAC AGC TGT GAG AAG GGT GAC ACC ACC 1152 
Cys Ala Arg Gly Arg Cys Ala Asn Ser Cys Glu Lys Gly Asp Thr Thr 
370 375 380 

ACC TTG TAC AGT CAG GGT GGC CAT GGG CAT GAC CCC AAG TCT GGC TTC 1200 
Thr Leu Tyr Ser Gin Gly Gly His Gly His Asp Pro Lys Ser Gly Phe 
385 390 395 400 

CGT ATC TAT TTC TGC CAA ATC CCC TGC CTG AAT GGT GGC CGC TGC ATC 124 8 

Arg He Tyr Phe Cys Gin He Pro Cys Leu Asn Gly Gly Arg Cys He 



si 



405 



410 



415 



GGC CGG GAC GAG TGC TGG TGT CCA GCC AAC TCC ACA GGA AAG TTC TGC 1296 
Gly Arg Asp Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 
420 425 430 

CAT CTG CCT GTC CCG CAG CCA GAC AGG GAA CCT GCA GGG CGA GGT TCC 134 4 

His Leu Pro Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 
435 440 445 

CGG CAC AGA ACC CTG CTG GAA GGT CCC CTG AAG CAA TCC ACC TTC ACG 1392 
Arg His Arg Thr ueu Leu Glu Gly Pro Leu Lys Gin Ser Thr Phe Thr 
450 455 460 

CTG CCT CTC TCT AAC CAG CTC GCC TCT GTG AAC CCC TCG CTG GTG AAG 14 4 0 

Leu Pro Leu Ser Asn Gin Leu Ala Ser Val Asn Pro Ser Leu Val Lys 
465 470 475 480 

GTG CAA ATT CAT CAC CCG CCT GAG GCC TCT GTG CAG ATT CAC CAG GTG 1488 
Val Gin He His His Pro Pro Glu Ala Ser Val Gin He His Gin Val 
485 490 495 

^ GCC CGG GTC CGG GGT GAG CTG GAC CCC GTG CTG GAG GAC AAC AGT GTG 1536 

Ala Arg Val Arg Gly Glu Leu Asp Pro Val Leu Glu Asp Asn Ser Val 

^ 500 505 510 

ftj GAG ACC AGA GCC TCT CAT CGC CCC CAC GGC AAC CTA GGC CAC AGC CCC 1584 

01 Glu Thr Arg Ala Ser His Arg Pro His Gly Asn Leu Gly His Ser Pro 
m 515 520 525 



yi 



m 



TGG GCC AGC AAC AGC ATA CCC GCT CGG GCC GGA GAG GCC CCT CGG CCA 1632 
Trp Ala Ser Asn Ser He Pro Ala Arg Ala Gly Glu Ala Pro Arg Pro 
530 535 540 



CCA CCA GTG CTG TCT AGG CAT TAT GGA CTT CTC GGC CAG TGT TAC CTG 168 0 

ftj Pro Pro Val Leu Ser Arg His Tyr Gly Leu Leu Gly Gin Cys Tyr Leu 
0 545 550 555 560 

U 

AGC ACG GTG AAT GGA CAG TGT GCT AAC CCC CTA GGT AGT CTG ACT TCT 172 8 

Ser Thr Val Asn Gly Gin Cys Ala Asn Pro Leu Gly Ser Leu Thr Ser 
565 570 575 

CAG GAG GAC TGC TGT GGC AGT GTG GGG ACC TTC TGG GGG GTG ACC TCC 177 6 

Gin Glu Asp Cys Cys Gly Ser Val Gly Thr Phe Trp Gly Val Thr Ser 
580 585 590 

TGT GCT CCC TGC CCA CCC AGA CAA GAG GGT CCA GCC TTC CCA GTG ATT 18 2 4 

Cys Ala Pro Cys Pro Pro Arg Gin Glu Gly Pro Ala Phe Pro Val He 

595 600 605 

GAA AAT GGC CAG CTG GAG TGT CCC CAA GGA TAC AAG AGA CTG AAC CTC 1872 

Glu Asn Gly Gin Leu Glu Cys Pro Gin Gly Tyr Lys Arg Leu Asn Leu 

610 615 620 

AGC CAC TGC CAA GAT ATC AAT GAG TGC CTG ACC CTG GGC CTC TGC AAG 1920 

Ser His Cys Gin Asp He Asn Glu Cys Leu Thr Leu Gly Leu Cys Lys 
625 630 635 640 



GAC TCG GAG TGC GTG AAC ACC AGG GGC AGC TAC CTG TGC ACC TGC AGG 1968 
Asp Ser Glu Cys Val Asn Thr Arg Gly Ser Tyr Leu Cys Thr Cys Arg 
645 650 655 

CCT GGC CTC ATG CTG GAT CCG TCA AGG AGC CGC TGC GTA TCG GAC AAG 2016 
Pro Gly Leu Met Leu Asp Pro Ser Arg Ser Arg Cys Val Ser Asp Lys 
660 665 670 

GCT GTC TCC ATG CAG CAG GGA CTA TGC TAC CGG TCA CTG GGG TCT GGT 2064 
Ala Val Ser Met Gin Gin Gly Leu Cys Tyr Arg Ser Leu Gly Ser Gly 
675 680 685 

ACC TGC ACC CTG CCT TTG GTT CAT CGG ATC ACC AAG CAG ATA TGC TGC 2112 
Thr Cys Thr Leu Pro Leu Val His Arg lie Thr Lys Gin lie Cys Cys 
690 695 700 

TGC AGC CGT GTG GGC AAA GCC TGG GGT AGC ACA TGT GAA CAG TGT CCC 2160 
Cys Ser Arg Val Gly Lys Ala Trp Gly Ser Thr Cys Glu Gin Cys Pro 
705 710 715 720 

CTG CCT GGC ACA GAA GCC TTC AGG GAG ATC TGC CCT GCT GGC CAT GGC 22 08 

Leu Pro Gly Thr Glu Ala Phe Arg Glu lie Cys Pro Ala Gly His Gly 
" 725 730 735 

TAC ACC TAC TCG AGC TCA GAC ATC CGC CTG TCT ATG AGG AAA GCC GAA 225 6 

Tyr Thr Tyr Ser Ser Ser Asp He Arg Leu Ser Met Arg Lys Ala Glu 
740 745 750 

GAA GAG GAA CTG GCT AGC CCC TTA AGG GAG CAG ACA GAG CAG AGC ACT 2304 
Glu Glu Glu Leu Ala Ser Pro Leu Arg Glu Gin Thr Glu Gin Ser Thr 
755 760 765 

GCA CCC CCA CCT GGG CAA GCA GAG AGG CAA CCA CTC CGG GCA GCC ACC 2352 
Ala Pro Pro Pro Gly Gin Ala Glu Arg Gin Pro Leu Arg Ala Ala Thr 
770 775 780 

GCC ACC TGG ATT GAG GCT GAG ACC CTC CCT GAC AAA GGT GAC TCT CGG 2400 
Ala Thr Trp He Glu Ala Glu Thr Leu Pro Asp Lys Gly Asp Ser Arg 
785 790 795 800 

GCT GTT CAG ATC ACA ACC AGT GCT CCC CAC CTA CCT GCC CGG GTA CCA 244 8 

Ala Val Gin He Thr Thr Ser Ala Pro His Leu Pro Ala Arg Val Pro 
805 810 815 

GGG GAT GCC ACT GGA AGA CCA GCA CCA TCC TTG CCT GGA CAG GGC ATT 24 9 6 

Gly Asp Ala Thr Gly Arg Pro Ala Pro Ser Leu Pro Gly Gin Gly He 
820 825 830 

CCA GAG AGT CCA GCA GAA GAG CAA GTG ATT CCC TCC AGT GAT GTC TTG 254 4 

Pro Glu Ser Pro Ala Glu Glu Gin Val He Pro Ser Ser Asp Val Leu 
835 840 845 

GTG ACA CAC AGC CCC CCA GAC TTT GAT CCA TGT TTT GCT GGA GCC TCC 2592 
Val Thr His Ser Pro Pro Asp Phe Asp Pro Cys Phe Ala Gly Ala Ser 
850 855 860 



AAC ATC TGT GGC CCT GGG ACC TGT GTG AGC CTC CCA AAT GGA TAC AGA 2640 
Asn lie Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 
865 870 875 880 

TGT GTC TGC AGC CCT GGC TAC CAG CTA CAC CCC AGC CAA GAC TAC TGT 2688 
Cys Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 
885 890 895 

ACT GAT GAC AAC GAG TGT ATG AGG AAC CCC TGT GAA GGA AGA GGG CGC 2736 
Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 
900 905 910 

TGT GTC AAC AGT GTG GGC TCC TAC TCC TGC CTC TGC TAT CCT GGC TAC 2784 
Cys Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 
915 920 925 

ACA CTA GTC ACC CTC GGA GAC ACA CAG GAG TGC CAA GAT ATC GAT GAG 2832 
Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp He Asp Glu 
930 935 940 

^GT GAG CAG CCC GGG GTG TGC AGT GGT GGG CGA TGC AGC AAC ACG GAG 2880 
^ Cys Glu Gin Pro Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 
^ 945 950 955 960 

Ul GGC TCG TAC CAC TGC GAG TGT GAT CGG GGC TAC ATC ATG GTC AGG AAA 2928 
yy Gly Ser Tyr His Cys Glu Cys Asp Arg Gly Tyr He Met Val Arg Lys 

ft! 965 970 975 

1^ GGA CAC TGT CAA GAT ATC AAC GAA TGC CGT CAC CCT GGT ACC TGC CCT 297 6 

Hi Gly His Cys Gin Asp He Asn Glu Cys Arg His Pro Gly Thr Cys Pro 

m 980 985 990 

C GAT GGG AGA TGC GTC AAC TCC CCT GGC TCC TAC ACT TGT CTG GCC TGT 3024 

Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 

fa* 995 1000 1005 

ft I 

fj GAG GAG GGC TAT GTA GGC CAG AGT GGG AGC TGT GTA GAT GTC AAT GAG 3072 

% Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 

*** 1010 1015 1020 

TGT CTG ACC CCT GGG ATA TGT ACC CAT GGA AGG TGC ATC AAC ATG GAA 3120 
Cys Leu Thr Pro Gly He Cys Thr His Gly Arg Cys He Asn Met Glu 
1025 1030 1035 1040 

GGC TCC TTT AGA TGC TCC TGT GAG CCG GGC TAT GAG GTC ACC CCA GAC 3168 
Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 
1045 1050 1055 

AAG AAG GGC TGC CGA GAT GTG GAC GAG TGT GCC AGC CGA GCC TCG TGC 3216 
Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 
1060 1065 1070 

CCC ACG GGC CTC TGC CTC AAC ACG GAG GGC TCC TTC ACC TGC TCA GCC 3264 
Pro Thr Gly Leu Cys Leu Asn Thr Glu Gly Ser Phe Thr Cys Ser Ala 
1075 1080 1085 

TGT CAG AGC GGG TAC TGG GTG AAC GAA GAT GGC ACT GCC TGT GAA GAC 3312 



Cys Gin Ser Gly Tyr Trp Val Asn Glu Asp Gly Thr Ala Cys Glu Asp 
1090 1095 1100 



TTG GAT GAA TGT GCC TTC CCT GGA GTC TGC CCC ACA GGC GTC TGC ACC 3360 
Leu Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Gly Val Cys Thr 
1105 1110 1115 1120 

AAT ACT GTA GGC TCC TTC TCC TGC AAG GAC TGT GAC CAG GGC TAC CGG 34 08 

Asn Thr Val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly Tyr Arg 
1125 1130 1135 

CCC AAC CCC CTG GGC AAC AGA TGC GAA GAT GTG GAT GAG TGT GAA GGT 345 6 

Pro Asn Pro Leu Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 
1140 1145 " 1150 

CCC CAA AGC AGC TGC CGG GGA GGC GAA TGC AAG AAC ACA GAA GGT TCC 3504 
Pro Gin Ser Ser Cys Arg Gly Gly Glu Cys Lys Asn Thr Glu Gly Ser 
1155 1160 1165 

TAC CAA TGC CTC TGT CAC CAG GGC TTC CAG CTG GTC AAT GGC ACC ATG 3552 
Tyr Gin Cys Leu Cys His Gin Gly Phe Gin Leu Val Asn Gly Thr Met 
m 1170 1175 1180 

Cj TGT GAG GAC GTG AAT GAG TGT GTT GGG GAA GAG CAT TGT GCT CCT CAC 3 600 

US Cys Glu Asp Val Asn Glu Cys Val Gly Glu Glu His Cys Ala Pro His 
tfl H 85 1190 1195 1200 

GGC GAG TGC CTC AAC AGC CTG GGC TCC TTC TTC TGC CTC TGT GCA CCC 364 8 

ffi Gly Glu Cys Leu Asn Ser Leu Gly Ser Phe Phe Cys Leu Cys Ala Pro 
% 1205 1210 1215 

^ GGC TTT GCT AGT GCT GAG GGG GGC ACC AGA TGC CAG GAT GTT GAT GAA 3 696 

Q Gly Phe Ala Ser Ala Glu Gly Gly Thr Arg Cys Gin Asp Val Asp Glu 
01 1220 1225 1230 

}'fj TGT GCA GCC ACA GAC CCG TGT CCG GGA GGA CAC TGT GTC AAC ACA GAG 3744 

^ Cys Ala Ala Thr Asp Pro Cys Pro Gly Gly His Cys Val Asn Thr Glu 
p t 1235 1240 1245 

GGC TCC TTC AGC TGT CTG TGT GAG ACT GCT TCC TTC CAG CCC TCC CCA 37 92 

Gly Ser Phe Ser Cys Leu Cys Glu Thr Ala Ser Phe Gin Pro Ser Pro 
1250 1255 1260 

GAC AGC GGA GAA TGT TTG GAT ATT GAT GAG TGT GAG GAC CGT GAA GAC 3840 
Asp Ser Gly Glu Cys Leu Asp He Asp Glu Cys Glu Asp Arg Glu Asp 
1265 1270 1275 1280 

CCG GTG TGC GGA GCC TGG AGG TGT GAG AAC AGT CCT GGT TCC TAC CGC 388 8 

Pro Val Cys Gly Ala Trp Arg Cys Glu Asn Ser Pro Gly Ser Tyr Arg 
1285 1290 1295 

TGC ATC CTG GAC TGC CAG CCT GGA TTC TAT GTG GCG CCA AAT GGA GAC 3936 
Cys He Leu Asp Cys Gin Pro Gly Phe Tyr Val Ala Pro Asn Gly Asp 
1300 1305 1310 



TGC ATT GAC ATA GAT GAA TGT GCC AAT GAC ACT GTG TGT GGG AAC CAT 
Cys He Asp He Asp Glu Cys Ala Asn Asp Thr Val Cys Gly Asn His 



3984 



1315 



1320 



1325 



-as-;;. 



GGC TTC TGT GAC AAC ACG GAC GGC TCC TTC CGC TGC CTG TGT GAC CAG 4032 
Gly Phe Cys Asp Asn Thr Asp Gly Ser Phe Arg Cys Leu Cys Asp Gin 
1330 1335 1340 

GGC TTC GAG ACC TCA CCA TCA GGC TGG GAG TGT GT^ GAT GTG AAC GAG 4 080 

Gly Phe Glu Thr Ser Pro Ser Gly Trp Glu Cys Val Asp Val Asn Glu 
1345 1350 1355 1360 

TGT GAG CTC ATG ATG GCA GTG TGT GGG GAT GCG CTC TGT GAG AAC GTG 4128 
Cys Glu Leu Met Met Ala Val Cys Gly Asp Ala .eu Cys Glu Asn Val 
1365 1370 1375 



GAA 


GGC 


TCC 


TTC 


CTG 


TGC 


CTT 


TGC 


GCC 


AGT 


GAC 


CTT 


GAG 


GAG 


TAC 


GAC 


4176 


Glu 


Gly 


Ser 


Phe 


Leu 


Cys 


Leu 


Cys 


Ala 


Ser 


Asp 


Leu 


Glu 


Glu 


Tyr Asp 










1380 








1385 








1390 






GCA 


GAA 


GAA 


GGA 


CAC 


TGC 


CGT 


CCT 


CGG 


GTG 


GCT 


GGA 


GCT 


CAG 


AGA 


ATC 


4224 


Ala 


Glu 


Glu 


Gly 


His 


Cys 


Arg 


Pro 


Arg 


Val 


Ala 


Gly Ala 


Gin 


Arg 


He 








1395 








1400 








1405 








CCA 


GAG 


GTC 


CGG 


ACA 


GAG 


GAC 


CAG 


GCT 


CCA 


AGC 


CTT 


ATC 


CGC 


ATG 


GAA 


4272 


Pro 


Glu 


Val 


Arg 


Thr 


Glu 


Asp 


Gin 


Ala 


Pro 


Ser 


Leu 


He 


Arg 


Met 


Glu 






1410 








1415 








1420 








TGC 


TAC 


TCT 


GAA 


CAC 


AAT 


GGT 


GGT 


CCT 


CCC 


TGC 


TCT 


CAA 


ATC 


CTG 


GGC 


4320 


Cys 


Tyr 


Ser 


Glu 


His 


Asn 


Gly 


Gly 


Pro 


Pro 


Cys 


Ser 


Gin 


He 


Leu 


Gly 




1425 








1430 








1435 








1440 




CAG 


AAC 


TCC 


ACA 


CAG 


GCC 


GAG 


TGC 


TGC 


TGC 


ACT 


CAG 


GGT 


GCC 


AGA 


TGG 


4368 


Gin 


Asn 


Ser 


Thr 


Gin 


Ala 


Glu 


Cys 


Cys 


Cys 


Thr 


Gin 


Gly Ala 


Arg 


Trp 












1445 








1450 








1455 




GGA 


AAG 


GCC 


TGT 


GCG 


CCC 


TGC 


CCA 


TCT 


GAG 


GAC 


TCA 


GTT 


GAA 


TTC 


AGT 


4416 


Gly 


Lys 


Ala 


Cys 


Ala 


Pro 


Cys 


Pro 


Ser 


Glu 


Asp 


Ser 


Val 


Glu 


Phe 


Ser 










1460 








1465 








1470 






CAG 


CTC 


TGC 


CCC 


AGT 


GGT 


CAA 


GGT 


TAC 


ATC 


CCA 


GTG 


GAA 


GGA 


GCC 


TGG 


4464 


Gin 


Leu 


Cys 


Pro 


Ser 


Gly 


Gin 


Gly 


Tyr 


He 


Pro 


Val 


Glu 


Gly 


Ala 


Trp 








1475 








1480 








1485 






ACA 


TTT 


GGA 


CAA 


ACC 


ATG 


TAT 


ACA 


GAT 


GCC 


GAT 


GAA 


TGT 


GTA 


CTG 


TTT 


4512 


Thr 


Phe 


Gly 


Gin 


Thr 


Met 


Tyr 


Thr 


Asp 


Ala 


Asp 


Glu 


Cys 


Val 


Leu 


Phe 






1490 










1495 








1500 










GGG 


CCT 


GCT 


CTC 


TGC 


CAG 


AAT 


GGC 


CGA 


TGC 


TCA 


AAC 


ATA 


GTG 


CCT 


GGC 


4560 


Gly 


Pro 


Ala 


Leu 


Cys 


Gin 


Asn 


Gly 


Arg 


Cys 


Ser 


Asn 


He 


Val 


Pro 


Gly 




1505 










1510 








1515 








1520 




TAC 


ATT 


TGC 


CTG 


TGC 


AAC 


CCT 


GGC 


TAC 


CAC 


TAT 


GAT 


GCC 


TCC 


AGC 


AGG 


4608 


Tyr 


He 


Cys 


Leu 


Cys 


Asn 


Pro 


Gly 


Tyr 


His 


Tyr 


Asp 


Ala 


Ser 


Ser 


Arg 












1525 








1530 








1535 





AAG TGC CAG GAT CAC AAC GAA TGC CAG GAC TTG GCC TGT GAG AAC GGT 
Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 
1540 1545 1550 



4656 



GAG TGT GTG AAC CAA GAA GGC TCC TTC CAT TGC CTC TGC AAT CCC CCC 4704 
Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
1555 1560 1565 

CTC ACC CTA GAC CTC AGT GGG CAG CGC TGT GTG AAC ACG ACC AGC AGC 4752 
Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
1570 1575 1580 

ACG GAG GAC TTC CCT GAC CAT GAC ATC CAC ATG GAC ATC TGC TGG AAA 4 800 

Thr Glu Asp Phe Pro Asp His Asp lie His Met Asp lie Cys Trp Lys 
1585 1590 1591 1600 

AAA GTC ACC AAT GAT GTG TGC AGC CAG CCC TTG CGT GGG CAC CAT ACC 48 4 8 

Lys Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 

ACC TAT ACA GAA TGC TGC TGC CAA GAT GGG GAG GCC TGG AGC CAG CAA 48 96 

Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

~^ TGC GCT CTG TGC CCG CCC AGG AGC TCT GAG GTC TAC GCT CAG CTG TGC 4 94 4 

lr jf Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 

S 1635 - 1640 1645 

i fl 

y9 AAC GTG GCT CGG ATT GAG GCA GAG CGC GGA GCA GGG ATC CAC TTC CGG 4 992 

fy Asn Val Ala Arg lie Glu Ala Glu Arg Gly Ala Gly lie His Phe Arg 
m 1650 1655 1660 



I fl 



CCA GGC TAT GAG TAT GGC CCT GGC CTG GAC GAT CTG CCT GAA AAC CTC 504 0 

Pro Gly Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 

TAC GGC CCA GAT GGG GCT CCC TTC TAT AAC TAC CTA GGC CCC GAG GAC 508 8 

Tyr Gly Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
1685 1690 1695 

ACT GCC CCT GAG CCT CCC TTC TCC AAC CCA GCC AGC CAG CCG GGA GAC 5136 
Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1705 1710 

AAC ACA CCT GTC CTT GAG CCT CCT CTG CAG CCC TCT GAA CTT CAG CCT 5184 
Asn Thr Pro Vai Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1725 

CAC TAT CTA GCC AGC CAC TCA GAA CCC CCT GCC TCC TTC GAA GGC CTT 5232 
His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 

CAG GCT GAG GAA TGT GGC ATC CTG AAT GGC TGT GAG AAT GGC CGC TGC 52 8 0 

Gin Ala Glu Glu Cys Gly lie Leu Asn Gly Cys Glu Asn Gly Arg Cys 
1745 1750 1755 1760 



GTG CGT GTG CGG GAG GGC TAC ACT TGC GAC TGC TTT GAG GGC TTC CAG 
Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 
1765 1770 1775 



5328 



CTG GAT GCG CCC ACA TTG GCC TGT GTG GAT GTG AAC GAG TGT GAA GAC 537 6 

Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
1780 1785 1790 

TTG AAC GGG CCT GCA CGA CTC TGT GCA CAC GGT CAC TGT GAG AAC ACA 5424 
Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 
1795 1800 1805 

GAG GGT TCC TAT CGC TGC CAC TGT TCG CCA GGT TAC GTG GCA GAG CCA 5472 
Glu Gly Ser ~* - \xq Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pro 
1810 1815 1820 

GGC CCC CCA CAC TGT GCG GCC AAG GAG 5499 
Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1833 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin Leu Cys Ser 

S 1 5 10 15 

U ii 

His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys Ser Gly Cys 
20 25 30 

jjat lie Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val Leu Ala Val 

fjjj 35 4 0 4 5 

■SSSSj 

2 Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser He Gly Arg Tyr Glu 
^ 50 55 60 

Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val Gly Ser His 
65 70 75 80 

Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg Glu Pro Asp 
85 90 95 

Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin Pro Ala Gin 
100 105 110 

Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg Pro Pro Arg 
115 120 125 

Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr Arg Arg Ser 
130 135 140 

His Pro Arg Gly Gin Gin Gin He Ala Ala Arg Ala Ala Pro Ser Val 
145 150 155 160 



Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 
165 170 175 



Leu Thr Gly Arg Asn Val Cys Gly Gly Gin Cys Cys Pro Gly Trp Thr 
180 185 190 

Thr Ser Asn Ser Thr Asn His Cys lie Lys Pro Val Cys Gin Pro Pro 
195 200 205 

Cys Gin Asn Arg Gly Ser Cys Ser Arg Pro Gin Val Cys lie Cys Arg 
210 215 210 

Ser Gly Phe Arg Gly Ala Arg Cys Glu Glu Val He Pro Glu Glu Glu 
225 230 235 240 

Phe Asp Pro Gin Asn Ala Arg Pro Val Pro Arg Arg Ser Val Glu Arg 
245 250 255 

Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg Gly Ser Leu Val Thr 
260 265 270 

Arg He Gin Pro Leu Val Pro Pro Pro Ser Pro Pro Pro Ser Arg Arg 
275 - 280 285 

Leu Ser Gin Pro Trp Pro Leu Gin Gin His Ser Gly Pro Ser Arg Thr 
290 295 300 

Val Arg Arg Tyr Pro Ala Thr Gly Ala Asn Gly Gin Leu Met Ser Asn 
305 310 315 320 

Ala Leu Pro Ser Gly Leu Glu Leu Arg Asp Ser Ser Pro Gin Ala Ala 
325 330 335 

His Val Asn His Leu Ser Pro Pro Trp Gly Leu Asn Leu Thr Glu Lys 
340 345 350 

He Lys Lys He Lys Val Val Phe Thr Pro Thr He Cys Lys Gin Thr 
355 360 365 

Cys Ala Arg Gly Arg Cys Ala Asn Ser Cys Glu Lys Gly Asp Thr Thr 
370 375 380 

Thr Leu Tyr Ser Gin Gly Gly His Gly His Asp Pro Lys Ser Gly Phe 
385 390 395 400 

Arg He Tyr Phe Cys Gin He Pro Cys Leu Asn Gly Gly Arg Cys He 
405 410 415 

Gly Arg Asp Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 
420 425 430 



His Leu Pro Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 
435 440 445 



Arg His Arg Thr Leu Leu Glu Gly Pro Leu Lys Gin Ser Thr Phe Thr 
450 455 460 



Leu Pro Leu Ser Asn Gin Leu Ala Ser Val Asn Pro Ser Leu Val Lys 
465 470 475 480 



m 



Val Gin He His His Pro Pro Glu Ala Ser Val Gin He His Gin Val 
485 490 495 

Ala Arg Val Arg Gly Glu Leu Asp Pro Val Leu Glu Asp Asn Ser Val 
500 505 510 

Glu Thr Arg Ala Ser His Arg Pro His Gly Asn Leu Gly His Ser Pro 
515 520 525 

Trp Ala Ser Asn Ser He Pro Ala Arg Ala Gly Glu Ala Pro Arg Pro 
530 535 540 

Pro Pro Val Leu Ser Arg His Tyr Gly Leu Leu Gly Gin Cys Tyr Leu 
545 550 555 560 

Ser Thr Val Asn Gly Gin Cys Ala Asn Pro Leu Gly Ser Leu Thr Ser 
565 570 575 

Gin Glu Asp Cys Cys Gly Ser Val Gly Thr Phe Trp Gly Val Thr Ser 
580 585 590 

Cys Ala Pro Cys Pro Pro Arg Gin Glu Gly Pro Ala Phe Pro Val He 
595 600 605 

Glu Asn Gly Gin Leu Glu Cys Pro Gin Gly Tyr Lys Arg Leu Asn Leu 
610 615 620 

Ser His Cys Gin Asp He Asn Glu Cys Leu Thr Leu Gly Leu Cys Lys 
625 630 635 640 

Asp Ser Glu Cys Val Asn Thr Arg Gly Ser Tyr Leu Cys Thr Cys Arg 
645 650 655 

Pro Gly Leu Met Leu Asp Pro Ser Arg Ser Arg Cys Val Ser Asp Lys 
660 665 670 

Ala Val Ser Met Gin Gin Gly Leu Cys Tyr Arg Ser Leu Gly Ser Gly 
675 680 685 

Thr Cys Thr Leu Pro Leu Val His Arg He Thr Lys Gin He Cys Cys 
690 695 700 

Cys Ser Arg Val Gly Lys Ala Trp Gly Ser Thr Cys Glu Gin Cys Pro 
705 710 715 720 

Leu Pro Gly Thr Glu Ala Phe Arg Glu He Cys Pro Ala Gly His Gly 
725 730 735 



Tyr Thr Tyr Ser Ser Ser Asp He Arg Leu Ser Met Arg Lys Ala Glu 
740 745 750 



Glu Glu Glu Leu Ala Ser Pro Leu Arg Glu Gin Thr Glu Gin Ser Thr 
755 760 765 



Ala Pro Pro Pro Gly Gin Ala Glu Arg Gin Pro Leu Arg Ala Ala Thr 
770 775 780 



Ala Thr Trp lie Glu Ala Glu Thr Leu Pro Asp Lys Gly Asp Ser Arg 
785 790 795 800 

Ala Val Gin lie Thr Thr Ser Ala Pro His Leu Pro Ala Arg Val Pro 
805 810 815 

Gly Asp Ala Thr Gly Arg Pro Ala Pro Ser Leu Pro Gly Gin Gly lie 
820 825 830 

Pro Glu Ser Pro Ala Glu Glu Gin Val lie Pro Ser Ser Asp Val Leu 
835 840 845 

Val Thr His Ser Pro Pro Asp Phe Asp Pro Cys Phe Ala Gly Ala Ser 
850 855 860 

Asn lie Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 
865 870 875 880 

% Cys Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 

- 885 890 895 

111 

iD Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 

flj 900 905 910 

f-p Cys Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 
[p 915 920 925 

L Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp He Asp Glu 
H 930 935 940 

H Cys Glu Gin P-o Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 

flf 945 950 955 960 

fj Gly Ser Tyr His Cys Glu Cys Asp Arg Gly Tyr He Met Val Arg Lys 

9 65 97 0 97 5 

Gly His Cys Gin Asp He Asn Glu Cys Arg His Pro Gly Thr Cys Pro 
980 985 990 

Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 
995 1000 1005 

Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 
1010 1015 1020 

Cys Leu Thr Pro Gly He Cys Thr His Gly Arg Cys He Asn Met Glu 
1025 1030 1035 1040 

Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 
1045 1050 1055 



Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 
1060 1065 1070 



Pro Thr Gly Leu Cys Leu Asn Thr Glu Gly Ser Phe Thr Cys Ser Ala 
1075 1080 1085 



Cys Gin Ser Gly Tyr Trp Val Asn Glu Asp Gly Thr h±a Cys Glu Asp 
1090 1095 1100 

Leu Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Gly Val Cys Thr 
1105 1110 1115 1120 

Asn Thr Val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly Tyr Arg 
1125 1130 1135 

Pro Asn Pro Leu Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 
1140 1145 1150 

Pro Gin Ser Ser Cys Arg Gly Gly Glu Cys Lys Asn Thr Glu Gly Ser 
1155 1160 1165 

Tyr Gin Cys Leu Cys His Gin Gly Phe Gin Leu Val Asn G^y Thr Met 
1170 1175 1180 

Cys Glu Asp Val Asn Glu Cys Val Gly Glu Glu His Cys Ala Pro His 
1185 * 1190 1195 1200 

Gly Glu Cys Leu Asn Ser Leu Gly Ser Phe Phe Cys Leu Cys Ala Pro 
1205 1210 1215 

Gly Phe Ala Ser Ala Glu Gly Gly Thr Arg Cys Gin Asp Val Asp Glu 
1220 1225 1230 

Cys Ala Ala Thr Asp Pro Cys Pro Gly Gly His Cys Val Asn Thr Glu 
1235 1240 1245 

Gly Ser Phe Ser Cys Leu Cys Glu Thr Ala Ser Phe Gin Pro Ser Pro 
1250 1255 1260 

Asp Ser Gly Glu Cys Leu Asp lie Asp Glu Cys Glu Asp Arg Glu Asp 
1265 1270 1275 1280 

Pro Val Cys Gly Ala Trp Arg Cys Glu Asn Ser Pro Gly Ser Tyr Arg 
1285 1290 1295 

Cys lie Leu Asp Cys Gin Pro Gly Phe Tyr Val Ala Pro Asn Gly Asp 
1300 1305 1310 

Cys lie Asp lie Asp Glu Cys Ala Asn Asp Thr Val Cys Gly Asn His 
1315 1320 1325 

Gly Phe Cys Asp Asn Thr Asp Gly Ser Phe Arg Cys Leu Cys Asp Gin 
1330 1335 1340 

Gly Phe Glu Thr Ser Pro Ser Gly Trp Glu Cys Val Asp Val Asn Glu 
1345 1350 1355 1360 



Cys Glu Leu Met Met Ala Val Cys Gly Asp Ala Leu Cys Glu Asn Val 
1365 1370 1375 



Glu Gly Ser Phe Leu Cys Leu Cys Ala Ser Asp Leu Glu Glu Tyr Asp 
1380 1385 1390 



Ala Glu Glu Gly His Cys Arg Pro Arg Val Ala Gly Ala Gin Arg He 
1395 1400 1405 

Pro Glu Val Arg Thr Glu Asp Gin Ala Pro Ser Leu He Arg Met Glu 
1410 1415 1420 

Cys Tyr Ser Glu His Asn Gly Gly Pro Pro Cys Ser Gin He Leu Gly 
1425 1430 1435 1440 

Gin Asn Ser Thr Gin Ala Glu Cys Cys Cys Thr Gin Gly Ala Arg Trp 
1445 1450 1455 

Gly Lys Ala Cys Ala Pro Cys Pro Ser Glu Asp Ser Val Glu Phe Ser 
1460 1465 1470 

Gin Leu Cys Pro Ser Gly Gin Gly Tyr He Pro Val Glu Gly Ala Trp 
1475 1480 1485 

Thr Phe Gly Gin Thr Met Tyr Thr Asp Ala Asp Glu Cys Val Leu Phe 
1490 ' 1495 1500 

Gly Pro Ala Leu Cys Gin Asn Gly Arg Cys Ser Asn He Val Pro Gly 
1505 1510 1515 1520 

Tyr He Cys Leu Cys Asn Pro Gly Tyr His Tyr Asp Ala Ser Ser Arg 
1525 1530 1535 

Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 
1540 1545 1550 

Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
1555 1560 1565 

Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
1570 1575 1580 

Thr Glu Asp Phe Pro Asp His Asp He His Met Asp He Cys Trp Lys 
1585 1590 1595 1600 

Lys Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 

Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 
1635 1640 1645 



Asn Val Ala Arg He Glu Ala Glu Arg Gly Ala Gly He His Phe Arg 
1650 1655 1660 



Pro Gly Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 



Tyr Gly Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
1685 1690 1695 



Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1705 1710 

Asn Thr Pro Val Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1725 

His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 

Gin Ala Glu Glu Cys Gly lie Leu Asn Gly Cys Glu Asn Gly Arg Cys 
1745 1750 1755 1760 

Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 
1765 1770 1775 

Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
1780 1785 1790 

Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 
1795 * 1800 1805 

Glu Gly Ser Tyr Arg Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pro 
1810 1815 1820 

Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: i.,37 59 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATG CGC CAG GCC GGC GGA TTG GGG CTG CTG GCA CTA CTC CTG CTG GCG 
Met Arg Gin Ala Gly Gly Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala 
15 10 15 

CTG CTG GGC CCC GGC GGC CGA GGG GTG GGC CGG CCG GGC AGC GGG GCA 
Leu Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala 
20 25 30 



CAG GCG GGG GCG GGG CGC TGG GCC CAA CGC TTC AAG GTG GTC TTT GCG 
Gin Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala 
35 40 45 



CCT GTG ATC TGC AAG CGG ACC TGT CTG AAG GGC CAG TGT CGG GAC AGC 
Pro Val He Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser 
50 55 60 

TGT CAG CAG GGC TCC AAC ATG ACG CTC ATC GGA GAG AAC GGC CAC AGC 
Cys Gin Gin Gly Ser Asn Met Thr Leu He Gly Glu Asn Gly His Ser 
65 70 75 80 



ACC GAC ACG CTC ACC GGT TCT GCC TTC CGC GTG GTG GTG TGC CCT CTA 
Thr Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu 
85 90 95 



CCC TGC ATG AAC GGT GGC CAG TGC TCT TCC CGA AAC CAG TGC CTG TGT 
Pro Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys 
100 105 110 



CCC CCG GAT TTC ACG GGG CGC TTC TGC CAG GTG CCT GCT GCA GGA ACC 
Pro Pro Asp Phe Thr Gly Arg Phe Cys Gin Val Pro Ala Ala Gly Thr 
115 120 125 

GGA GCT GGC ACC GGG AGT TCA GGC CCC GGC TGG CCC GAC CGG GCC ATG 
Gly Ala Gly Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met 
130 - 135 140 

TCC ACA GGC CCG CTG CCG CCC CTT GCC CCA GAA GGA GAG TCT GTG GCT 
Ser Thr Gly Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala 
145 150 155 160 



AGC AAA CAC GCC ATT TAC GCG GTG CAG GTG ATC GCA GAT CCT CCC GGG 
Ser Lys His Ala He Tyr Ala Val Gin Val He Ala Asp Pro Pro Gly 
165 170 175 

CCG GGG GAG GGT CCT CCT GCA CAA CAT GCA GCC TTC TTG GTG CCC CTG 
Pro Gly Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu 
180 185 190 

GGG CCA GGA CAA ATC TCG GCA GAA GTG CAG GCT CCG CCC CCC GTG GTG 
Gly Pro Gly Gin He Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val 
195 200 205 



AAC GTG CGT GTC CAT CAC CCT CCT GAA GCT TCC GTT CAG GTG CAC CGC 
Asn Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg 
210 215 220 

ATC GAG GGG CCG AAC GCT GAA GGC CCA GCC TCT TCC CAG CAC TTG CTG 
He Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu 
225 230 235 240 



CCG CAT CCC AAG CCC CCG CAC CCG AGG CCA CCC ACT CAA AAG CCA CTG 
Pro His Pro Lys Pro Pro His Pro Arg Pro Pro Thr Gin Lys Pro Leu 
245 250 255 

GGC CGC TGC TTC CAG GAC ACA TTG CCC AAG CAG CCT TGT GGC AGC AAC 
Gly Arg Cys Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn 
260 265 270 



CCT TTG CCT GGC CTT ACC AAG CAG GAA GAT TGC TGC GGT AGC ATC GGT 864 
Pro Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser lie Gly 
275 280 285 

ACT GCC TGG GGA CAA AGC AAG TGT CAC AAG TGC CCA CAG CTT CAG TAT 912 
Thr Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr 
290 295 300 

ACA GGG GTG CAG AAG CCT GTA CCT GTA CGT GGG GAG GTG GGT GCT GAC 960 
Thr Gly Val lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp 

305 310 315 320 

TGC CCC CAG GGC TAC AAG AGG CTC AAC AGC ACC CAC TGC CAG GAT ATC 1008 
Cys Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp lie 
325 330 335 

AAC GAA TGT GCG ATG CCC GGG AAT GTG TGC CAT GGT GAC TGC CTC AAC 1056 
Asn Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn 
340 345 350 

AAC CCT GGC TCT TAT CGC TGT GTC TGC CCG CCC GGT CAT AGC TTG GGT 1104 
Asn Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly 
355 360 365 

CCC CTC GCA GCA CAG TGC ATT GCC GAC AAA CCA GAG GAG AAG AGC CTG 1152 
Pro Leu Ala Ala Gin Cys He Ala Asp Lys Pro Glu Glu Lys Ser Leu 
370 375 380 

TGT TTC CGC CTT GTG AGC ACC GAA CAC CAG TGC CAG CAC CCT CTG ACC 1200 
Cys Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr 
385 390 395 400 

ACA CGC CTA ACC CGC CAG CTC TGC TGC TGT AGT GTG GGT AAA GCC TGG 124 8 

Thr Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp 
405 410 415 

GGT GCC CGG TGC CAG CGC TGC CCG GCA GAT GGT ACA GCA GCC TTC AAG 12 96 

Gly Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys 
420 425 430 

GAG ATC TGC CCC GGC TGG GAA AGG GTA CCA TAT CCT CAC CTC CCA CCA 1344 
Glu He Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro 
435 440 445 

GAC GCT CAC CAT CCA GGG GGA AAG CGA CTT CTC CCT CTT CCT GCA CCC 1392 
Asp Ala His His Pro Gly Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro 
450 455 460 

GAC GGG CCA CCC AAA CCC CAG CAG CTT CCT GAA AGC CCC AGC CGA GCA 14 40 

Asp Gly Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala 
465 470 475 480 

CCA CCC CTC GAG GAC ACA GAG GAA GAG AGA GGA GTG ACC ATG GAT CCA 1488 
Pro Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro 
485 490 495 

CCA GTG AGT GAG GAG CGA TCG GTG CAG CAG AGC CAC CCC ACT ACC ACC 15 36 



Pro Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr 
500 505 510 

ACC TCA CCC CCC CGG CCT TAC CCA GAG CTC ATC TCT CGC CCC TCC CCA 
Thr Ser Pro Pro Arg Pro Tyr Pro Glu Leu lie Ser Arg Pro Ser Pro 
515 520 525 

CCT ACC TTC CAC CGG TTC CTG CCA GAC TTG CCC CCA TCC CGA AGT GCA 
Pro Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala 
530 535 540 

GTG GAG ATC GCC JCC ACT CAG GTC ACA GAG ACC GAT GAG TGC CGA TTG 
Val Glu lie Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu 
545 550 555 560 

AAC CAG AAT ATC TGT GGC CAT GGA CAG TGT GTG CCT GGC CCC TCG GAT 
Asn Gin Asn lie Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp 
565 570 575 

TAC TCC TGC CAC TGC AAC GCT GGC TAC CGG TCA CAC CCG CAG CAC CGC 
Tyr Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg 
580 585 590 

TAC TGT GTT GAT GTG AAC GAG TGC GAG GCA GAG CCC TGC GGC CCC GGG 
Tyr Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly 
595 600 605 

AAA GGC ATC TGT ATG AAC ACT GGT GGC TCC TAC AAT TGT CAC TGC AAC 
Lys Gly lie Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn 
610 615 620 

CGA GGC TAC CGC CTC CAC GTG GGT GCA GGG GGC CGC TCG TGC GTG GAC 
Arg Gly Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp 
625 630 635 640 

CTG AAC GAG TGC GCC AAG CCT CAC CTG TGT GGG GAC GGT GGC TTC TGC 
Leu Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys 
645 650 655 

ATC AAC TTC CCT GGT CAC TAC AAA TGC AAC TGC TAT CCT GGC TAC CGG 
He Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg 
660 665 670 

CTC AAG GCC TCC CGA CCG CCC ATT TGC GAA GAC ATC GAC GAG TGT CGC 
Leu Lys Ala Ser Arg Pro Pro He Cys Glu Asp He Asp Glu Cys Arg 
675 680 685 

GAC CCT AGC ACC TGC CCT GAT GGC AAA TGT GAA AAC AAA CCT GGC AGC 
Asp Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser 
690 695 700 

TTC AAG TGC ATC GCC TGC CAG CCT GGC TAC CGT AGC CAG GGG GGC GGG 
Phe Lys Cys He Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly 
705 710 715 720 



GCC TGT CGT GAT GTC AAC GAA TGC TCC GAG GGT ACC CCC TGC TCT CCT 
Ala Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro 



725 



730 



735 



GGA TGG TGT GAG AAC CTT CCG GGT TCT TAG CGT TGC ACG TGT GCC CAG 2256 
Gly Trp Cys Glu Asn Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin 
740 745 750 

GGG ATA CGA ACC CGC ACA GGA CGC CTC AGT TGC ATA GAC GTG GAT GAG 2304 
Gly He Arg Thr Arg Thr Gly Arg Leu Ser Cys He Asp Val Asp Glu 
755 760 765 

TGT GAG GCT GGG AAA GTG TGC CAA GAT GGC ATC TGC ACG AAC ACA CCA 2352 
Cys Glu Ala Gly ^ys Val Cys Gin Asp Gly He Cys Thr Asn Thr Pro 
770 775 780 

GGC TCT TTC CAG TGT CAG TGC CTC TCC GGC TAT CAT CTG TCA AGG GAT 2 4 00 

Gly Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp 
785 790 795 800 

CGG AGC CGC TGT GAG GAC ATT GAT GAA TGT GAC TTC CCT GCG GCC TGC 24 4 8 

Arg Ser Arg Cys Glu Asp He Asp Glu Cys Asp Phe Pro Ala Ala Cys 
805 810 815 

ATC GGG GGT GAC TGC ATC AAT ACC AAT GGT TCC TAC AGA TGT CTC TGT 24 96 

He Gly Gly Asp Cys He Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys 
820 825 830 

CCC CTG GGT CAT CGG TTG GTG GGC GGC AGG AAG TGC AAG AAA GAT ATA 2 544 

Pro Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He 
835 840 845 



m 

m GAT GAG TGC AGC CAG GAC CCA GGC CTG TGC CTG CCC CAT GCC TGC GAG 25 92 

L. Asp Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu 
U 850 855 860 

|J1 

U AAC CTC CAG GGC TCC TAT GTC TGT GTC TGT GAT GAG GGT TTC ACA CTC 2 640 

Hi Asn Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu 
0 865 870 875 880 

^ ACC CAG GAC CAG CAT GGG TGT GAG GAG GTG GAG CAG CCC CAC CAC AAG 2 68 8 

Thr Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys 
885 890 895 

AAG GAG TGC TAC CTT AAC TTC GAT GAC ACA GTG TTC TGT GAC AGC GTA 27 36 

Lys Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val 
900 905 910 

TTG GCT ACC AAT GTC ACT CAG CAG GAA TGC TGT TGC TCT CTG GGA GCT 27 84 

Leu Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala 
915 920 925 

GGC TGG GGA GAC CAC TGC GAA ATC TAT CCC TGT CCA GTC TAC AGC TCA 2832 
Gly Trp Gly Asp His Cys Glu He Tyr Pro Cys Pro Val Tyr Ser Ser 
930 935 940 

GCC GAA TTT CAC AGC CTG GTG CCT GAT GGG AAA AGG CTA CAC TCA GGA 288 0 

Ala Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly 
945 950 955 960 



CAA CAA CAT TGT GAA CTA TGC ATT CCT GCC CAC CGT GAC ATC GAC GAA 2928 
Gin Gin His Cys Glu Leu Cys lie Pro Ala His Arg Asp lie Asp Glu 
965 970 975 

TGC ATA TTG TTT GGG GCA GAG ATC TGC AAG GAG GGC AAG TGT GTG AAC 2 97 6 

Cys He Leu Phe Gly Ala Glu He Cys Lys Glu Gly Lys Cys Val Asn 
980 985 990 

ACG CAG CCC GGC TAC GAG TGC TAG TGC AAG CAG GGC TTC TAC TAC GAT 3024 
Thr Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp 
995 1000 1005 

GGC AAC CTG CTG GAG TGC GTG GAC GTG GAT GAG TGC TTG GAT GAG TCT 3072 
Gly Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser 
1010 1015 1020 

AAC TGC AGG AAC GGA GTG TGT GAG AAC ACA CGT GGC GGC TAC CGC TGT 3120 
Asn Cys Arg Asn Gly Val Cys Glu Asn Thr Arg Gly Gly Tyr Arg Cys 
1025 1030 1035 1040 

GCC TGC ACT CCG CCG GCA GAG TAC AGT CCC GCA CAG GCC CAG TGT CTG 3168 
% Ala Cys Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu 
^ - 1045 1050 1055 

gut; 

&ji ATC CCG GAG AGA TGG AGC ACG CCC CAG AGA GAC GTG AAG TGT GCT GGG 3216 

Hi He Pro Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cys Ala Gly 

01 1060 1065 1070 

iUL-3 

HI GCC AGC GAG GAG AGG ACG GCA TGT GTA TGG GGC CCC TGG GCG GGA CCT 32 64 

I" Ala Ser Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro 
1075 1080 1085 

GCC CTC ACT TTT GAT GAC TGC TGC TGC CGC CAG CCG CGG CTG GGT ACC 3312 
*f Ala Leu Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr 
flj 1090 1095 1100 

Q CAG TGC AGA CCG TGC CCG CCA CGT GGC ACC GGG TCC CAG TGC CCG ACT 33 60 

Gin Cys Arg Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr 
1105 1110 1115 1120 

TCA CAG AGT GAG AGC AAT TCT TTC TGG GAC ACA AGC CCC CTG CTA CTG 34 08 

Ser Gin Ser Glu Ser Asn Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu 
1125 1130 1135 

GGG AAG TCT CCG CGA GAC GAA GAC AGC TCA GAG GAG GAT TCA GAT GAG 34 56 

Gly Lys Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu 
1140 1145 1150 

TGC CGT TGT GTG AGC GGA CGC TGT GTG CCA CGG CCA GGC GGG GCG GTA 350 4 

Cys Arg Cys Val Ser Gly Arg Cys Val Pro Arg Pro Gly Gly Ala Val 
1155 1160 1165 

TGC GAG TGT CCT GGA GGC TTT CAG CTG GAC GCC TCC CGT GCC CGC TGC 3552 
Cys Glu Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys 
1170 1175 1180 



GTG GAC ATT GAT GAG TGC CGA GAA CTG AAC CAG CGG GGA CTG CTG TGT 3600 
Val Asp He Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys 
1185 1190 1195 1200 

AAG AGC GAG CGG TGC GTG AAC ACC AGT GGA TCC TTC CGC TGT GTC TGC 364 8 

Lys Ser Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys 
1205 1210 1215 

AAA GCT GGC TTC ACG CGC AGC CGC CCT CAC GGG CCT GCG TGC CTC AGC 3696 
Lys Ala Gly Phe Thr Arg Ser Arg Pro His Gly Pro Ala Cys Leu Ser 
1220 1225 1230 

GCC GCC GCT GAT GAT GCA GCC ATA GCC CAC ACC TCA GTG ATC GAT CAT 374 4 

Ala Ala Ala Asp Asp Ala Ala He Ala His Thr Ser Val He Asp His 
1235 1240 1245 

CGA GGG TAT TTT CAC 3759 
Arg Gly Tyr Phe His 
1250 



m (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1253 amino acids 

CI (B) TYPE: amino acid 

fli (D) TOPOLOGY: linear 

£0 (ii) MOLECULE TYPE: protein 

* s (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

!S Met Arg Gin Ala Gly Gly Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala 

CP 15 10 15 

fU Leu Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala 
Q 20 25 30 

Gin Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala 
35 40 45 

Pro Val He Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser 
50 55 60 

Cys Gin Gin Gly Ser Asn Met Thr Leu He Gly Glu Asn Gly His Ser 
65 70 75 80 

Thr Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu 
85 90 95 

Pro Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys 
100 105 HO 

Pro Pro Asp Phe Thr Gly Arg Phe Cys Gin Val Pro Ala Ala Gly Thr 
115 120 125 

Gly Ala Gly Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met 



130 



135 



140 



Ser Thr Gly Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala 
145 150 155 160 

Ser Lys His Ala He Tyr Ala Val Gin Val He Ala Asp Pro Pro Gly 
165 170 175 

Pro Gly Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu 
180 185 190 

Gly Pro Gly Gin He Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val 
195 200 205 

Asn Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg 
210 215 220 

He Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu 
225 230 235 240 

Pro His Pro Lys Pro Pro His Pro Arg Pro Pro Thr Gin Lys Pro Leu 
245 250 ' 255 

Gly Arg Cys Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn 
260 265 270 

Pro Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser He Gly 
275 280 285 

Thr Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr 
290 295 300 

Thr Gly Val Gin Lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp 
305 310 315 320 

Cys Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp He 
325 330 335 

Asn Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn 
340 345 350 

Asn Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly 
355 360 365 

Pro Leu Ala Ala Gin Cys He Ala Asp Lys Pro Glu Glu Lys Ser Leu 

370 375 380 

Cys Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr 
385 390 395 400 

Thr Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp 
405 410 415 



Gly Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys 
420 425 430 



Glu He Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro 



435 



440 



445 



Asp Ala His His Pro Gly Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro 
450 455 460 

Asp Gly Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala 
465 470 475 480 

Pro Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro 
485 490 495 

Pro Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr 
500 505 510 

Thr Ser Pro Pro Arg Pro Tyr Pro Glu Leu lie Ser Arg Pro Ser Pro 
515 520 525 

Pro Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala 
530 535 540 

Val Glu lie Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu 
545 550 555 560 

Asn Gin Asn lie Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp 
565 570 575 

Tyr Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg 
580 585 590 

Tyr Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly 
595 600 605 

Lys Gly lie Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn 
610 615 620 

Arg Gly Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp 
625 630 635 640 

Leu Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys 
645 650 655 

lie Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg 
660 665 670 

Leu Lys Ala Ser Arg Pro Pro lie Cys Glu Asp lie Asp Glu Cys Arg 
675 680 685 

Asp Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser 
690 695 700 

Phe Lys Cys lie Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly 
705 710 715 720 



Ala Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro 
725 730 735 



Gly Trp Cys Glu Asn Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin 



740 



745 



750 



Gly lie Arg Thr Arg Thr Gly Arg Leu Ser Cys lie Asp Val Asp Glu 
755 760 765 

Cys Glu Ala Gly Lys Val Cys Gin Asp Gly He Cys Thr Asn Thr Pro 
770 775 780 

Gly Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp 
785 790 795 800 

Arg Ser Arg Cy. Glu Asp He Asp Glu Cys Asp Phe Pro Ala Ala Cys 
805 810 815 

He Gly Gly Asp Cys He Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys 
820 825 830 

Pro Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He 
835 840 845 

Asp Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu 
850 855 860 

Asn Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu 
865 870 875 880 

Thr Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys 
885 890 895 

Lys Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val 
900 905 910 

Leu Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala 
915 920 925 

Gly Trp Gly Asp His Cys Glu He Tyr Pro Cys Pro Val Tyr Ser Ser 
930 935 940 

Ala Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly 
945 950 955 960 

Gin Gin His Cys Glu Leu Cys He Pro Ala His Arg Asp He Asp Glu 
965 970 975 

Cys He Leu Phe Gly Ala Glu He Cys Lys Glu Gly Lys Cys Val Asn 
980 985 990 

Thr Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp 
995 1000 1005 

Gly Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser 
1010 1015 1020 



Asn Cys Arg Asn Gly Val Cys Glu Asn Thr Arg Gly Gly Tyr Arg Cys 
1025 1030 1035 1040 



Ala Cys Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu 



1045 



1050 



1055 



He Pro Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cys Ala Gly 
1060 1065 1070 

Ala Ser Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro 
1075 1080 1085 

Ala Leu Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr 
1090 1095 1100 

Gin Cys A ;g Pro Cys Pro P. o Arg Gly rhx Gly jer Gin Cys Pro Thi 
1105 1110 1115 1120 

Ser Gin Ser Glu Ser Asn Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu 
1125 1130 1135 

Gly Lys Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu 
1140 1145 1150 

Cys Arg Cys Val Ser Gly Arg Cys Val Pro Arg Pro Gly Gly Ala Val 
1155 1160 1165 

Cys Glu Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys 
1170 1175 1180 

Val Asp He Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys 
1185 1190 1195 1200 

Lys Ser Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys 
1205 1210 1215 

Lys Ala Gly Phe Thr Arg Ser Arg Pro His Gly Pro Ala Cys Leu Ser 
1220 1225 1230 

Ala Ala Ala Asp Asp Ala Ala He Ala His Thr Ser Val He Asp His 
1235 1240 1245 



Arg Gly Tyr Phe His 
1250 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Gly Glu Ser Val Ala Ser Lys His Ala He Tyr Ala Val Cys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
TACCGATGCT AOOGCAGCAA TCTT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATGCCTAAAC TCTACCAGCA CG 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GAGTCACGTC ATCCATTCCA CA 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGTCCAAGTT GTGTCTTAGC AG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Pro Pro Gly Pro Gin Gly Ala Thr Gly Pro Leu Gly Pro Lys Gly 
15 10 15 

Gin Thr Gly Glu Pro Gly He Ala Gly Phe Lys Gly Glu Gin Gly Pro 
20 25 30 

Lys Gly Glu Thr Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Pro Ala 
31 40 45 

Gly Glu Glu Gly Lys 
50 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ^ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGCCCTCCCG GTCCTCAAGG TGCAACTGGT CCTCTGGGCC CCAAAGGTCA GACGGGTGAG 
CCCGGCATCG CTGGCTTCAA AGGTGAACAA GGCCCCAAGG GAGAGACTGG ACCTGCTGGG 
CCCCAGGGAG CCCCTGGCCC TGCTGGTGAA GAAGGAAAA 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /note- "N = A or G or C or T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AAACGTCACA CGTGANACGT GAACGTTGCT TGCTGG 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTACGTCCAC GTACACGTCT AGCAAGCAAG CA 32 
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